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CHAPTER | 
The calculus of variations 


|.1 Introduction 


In this final block of MS324 we introduce two related mathematical entities, 
which are of use in many areas of applied mathematics and are of immense 
importance in formulating the laws of theoretical physics. These are the 
functional and variational principles. The theory of these entities is called 
the calculus of variations. 


A functional is a generalisation of a function. A real function of a single real 
variable maps an interval of the real line to real numbers; for instance, the 
function f(x) = 1/(1+27) maps the whole real line to the interval (0, 1]; the 
function Inz maps the positive real axis to the whole real line. Similarly, a 
real function of n real variables maps a domain of R” into the real numbers. 


A functional maps a given class of functions to real numbers. A simple 
example of a functional is 


Sly] = / dxy'(e)?, y(0)=0, y(1) =], (1.1) 


which associates a real number with any real function, y(x), satisfying the 
boundary conditions and for which the integral exists. Here y/(x) = dy/dz, 
and throughout this block we use the notation f’(x) to denote the derivative 
of f(x) with respect to its argument when f(z) is a function of a single 
variable. The square bracket notation S{y] is used to emphasise the fact 
that the functional depends upon the choice of function used to evaluate the 
integral. Later in this chapter we shall see that a wide variety of problems 
can be described in terms of functionals. 


Real functions of n real variables can have various properties; for instance 
they can be continuous, they may be differentiable or they can have station- 
ary points and local maxima and minima. Functionals share many of these 
properties. In particular, the notion of a stationary point of a function has 
an important analogy in the theory of functionals and this gives rise to the 
idea of a variational principle, which arises when the solution to a problem 
is given by a function that makes a given functional stationary. Variational 
principles are very common and important in the natural sciences. 


A simple example of a variational principle is that of finding the shortest 
distance between two points. Suppose that the two points lie in a plane, 
with one point at the origin, O, and the other at point A with coordinates 
(1,1), and that y(az) represents a smooth curve passing through O and A. 
The distance between O and A along this curve is given by the functional 


) 
Sly] = / di «fi FOP a0) = 0 y(1) = 1. (1.2) 
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This result is proved in Section 1.2, where we also prove the physically 
obvious result that the shortest distance is given by the straight line joining 
O to A. This may seem like using a sledgehammer to crack a walnut, but 
the methods introduced are important parts of the calculus of variations 
and can be used to solve more general and difficult problems. 


Variational principles are important for three principal reasons. First, many 
problems are naturally formulated in terms of a functional and an associated 
variational principle. For instance, the shape of a closed curve of given length 
that encloses the largest area is defined by a variational principle. When 
the closed curve is on a flat surface, this becomes a traditional problem that 
was first posed and solved by Zenodorus (c. 180 BC), but for curves on other 
surfaces the solution is not so easy. Other examples are the shape assumed 
by a hanging chain, which is determined by minimising the functional for 
the energy, and the path taken by a ray of light, which minimises the time 
of passage. These, and other, variational principles will be described later 
in this chapter. 


Second, most equations of mathematical physics can be derived from vari- 
ational principles. This is important partly because it suggests a unifying 
theme in our description of nature and partly because such formulations 
are independent of any particular coordinate system, so making the essen- 
tial mathematical structure of the equations more transparent and easier 
to understand. We explore some of these aspects in Chapter 4, where we 
reformulate Newton’s equationsof motion as a variational principle. 


Finally, variational principles provide powerful computational tools; this use 
is not considered here. 


The structure of this block is as follows. In this introductory chapter we 
first consider the particular variational principle defining the shortest dis- 
tance between two points in a plane. This simple example is used to in- 
troduce many of the important ideas associated with functionals and also 
the notation required to deal with the general theory. Chapter 1 ends with 
a discussion of some of the problems that can be formulated in terms of 
variational principles. 


Chapter 2 contains the main theory, where we deal with a general class 
of variational principles described by the Euler-Lagrange equation. We 
develop and generalise the ideas introduced here, in order to obtain necessary 
conditions for a function to make this class of functionals stationary. 


Chapter 3 is in two parts. In the first part we develop a minor generalisation 
of the theory described in Chapter 2, which is required later in order to 
reformulate Newton’s equations as a variational principle. In the second 
part we consider how some general properties of functionals can impose 
constraints upon the behaviour of the stationary functions. For instance, 
this theory shows how momentum conservation in a mechanical system is 
caused by the invariance of the associated functional under translations in 
space; similarly, energy conservation is a consequence of the invariance of 
the associated functional under translations in time. 


In Chapter 4 we discuss the application of Newton’s equations to simple 
mechanical systems and show how this task is often made easier using La- 
erange’s formulation. We also show that, under certain widely applicable 
circumstances, Newton’s three laws can be expressed as a simpler variational 
principle, called Hamilton’s principle. 
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Exercise I.1 


Determine the value of the functional 


1 
Stu) =f aey'(e)*, (0) =0, y(t) =1 
for the following functions: 


(a) y(v)=2 
(b) y(x) = x* where a > $ 


(¢). y(a) = sini /2) 


1.2 The shortest distance between 
two points 


The distance between two points on a given surface can be expressed in 
terms of a functional depending on the path between the two points and the 
properties of the surface. The simplest surface is a two-dimensional plane, 
and we deal with this case here. 


In this section two significant results are derived. First, we derive the func- 
tional for the distance between two points in a plane along a given smooth 
curve. Second, we show that the stationary path is a straight line. 


It is well known that the shortest path between two points in a plane is the 
straight line joining them; indeed, it is Euclid’s definition of a straight line. 
However, it is almost always easiest to understand a new idea by applying it 
to asimple familiar problem, so here we introduce the ideas of the calculus of 
variations by finding the equation of this line. The algebra may seem over- 
complicated for this simple problem, but far more complicated problems 
may be solved with very little extra effort. 


1.2.1 The distance between two points on a given 
curve 


First, we need an expression for the length of a curve between two given 
points P, and P, in a plane, having the Cartesian coordinates (a, A) and 
(b, B), respectively. If the curve is represented by y = f(x) witha< 2 <b, 
then, since it passes through P, and P,, we must have f(a) = A and f(b) = 
B, as shown in Figure 1.1. We also assume that f(x) is differentiable for 
a ee 


Figure 1.1 The graph of an arbitrary curve passing through P, and Py, with 
coordinates (a, A) and (b, B), respectively. 


See Book I of Euclid’s 
Elements, Definition 4. 
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We assume that the curve joining P, and P, is described by a single-valued 
function f(x), i.e. for each x in the interval a < x < b, f(x) has one and 
only one value. ‘This means that curves such as those shown in Figures 1.2 
and 1.3 are not considered, because they are not the graphs of single-valued 
functions. 


Figure 1.2 Figure 1.3 
Divide the interval between a and b into N equal-length segments (7,1, Xx), 
k= 1,2,...,N, where @ =a ane 7 =o SO 

Tei1=%e+6x and Ndx =b-a, (ha) 


as shown in Figure 1.4. 


LO L1 L2 L3 L4 L5 eo 


Figure 1.4 Diagram showing the subdivision of the interval [a,b] into N equal 
intervals, where 441 — ©, = 02, k = 0,1,..., N — 1; in this example N = 5. 


The curve is now approximated by a continuous curve C'y comprising N 
straight-line segments, the kth segment joining the points (x,_1, yr_1) and 
(Te, Ye), Where y, = f(r,). The length of the original curve is defined to be 
the limit as N — oo of the length of C'y. Thus we need to determine the 
length of each segment of C’y, to add these together and then to take the 
limit as N — oo. 


First, we find the distance along the straight line between (xz, y,) and 
(Ck41,Ye+1). Since yz, = f(xz), k =0,1,...,N, are the points defining Cy 
the relation between successive values of y is obtained using Taylor’s expan- 
sion, 
Yeti = fle + 6x) 
= yn + dr f'(az) + O(6a"), yx = f (ak). (1.4) 


The length of the required segment, denoted by ds,z, is given by Pythagoras’ 
theorem, as illustrated in Figure 1.5. 


A more rigorous definition 

of the length allows for the 
increments along the z-axis 
to be different lengths. 
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Yk+1 


Lk Le+1 r 


Figure 1.5 


The length 6s; is determined by the end points: 
bsp = (@e41 — tk)” + (Yeti — YR)? 
= dx? + (f"(xp) dx + O(5a?))” = (1+ f'(a)*) 62” + O(623), (1.5) 


where we have used equation (1.4) to write yp41 — ye = f'(xp)dx + O(6x7). 
On taking the square root of each side of this equation, we obtain an ex- 
pression for 0s; 


6s, = 62 1+ f' (xp)? + O(6z) 


O(6x) 
= dx \/1+ f'(xp)?4/1 + ————— 1.6 
and hence, using the binomial expansion for the right-hand side, 


5s, = dx,/1 + f'(xp)? + O(6z"). (EF) 


The approximate distance from (a, A) to (b, B) along the curve y = f(z) is 
given by the sum of all the segment lengths: 


N-1 N-1 
Sift S— bsp = ide /1+ fl(z_)2 + O(N6z). (1.8) 
k=0 k=0 
Since Néxz = b — a, as dx — 0 this sum becomes the integral 


b 
sii= | a Ge ah te i) = B: (1.9) 


The distance between the two points P, and P,, along the curve y = f(x), 
is given by the above expression: for each suitable real function, f(x), the 
integral has a numerical value denoted by S|f], the functional for the dis- 
tance. For instance, if P, is at the origin and P, = (1,0) the distance from 
P,, to P, along the straight line joining these points is 1, whereas along the 
semi-circle of which P,P; is a diameter, the distance is 7/2. 


The functional (1.9) defines the distance between the points (a, A) and (b, B) 
along a given curve passing through these points. In the next subsection we 
consider how the length of the path varies with f(x) and we shall define the 
notion of a stationary function; we also show that the straight line through 
the end points is a stationary function. This may seem an elaborate method 
of deriving the fourth definition given in Euclid’s Elements, particularly since 
we have already used this definition to derive said functional. However, the 
method of deriving this functional and finding its stationary paths can easily 
be generalised to more difficult problems; for instance, it may be used to 
derive functionals for the shortest distance between two points on curved 
surfaces. Some examples of such problems are discussed in Exercises 1.26 
and 1.27. 
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Exercise |.2 
If P, is at the origin and P, = (b,0), where b > 0, show the following. 


(a) If y= f(x) is the straight line joining P, to Py, then the integral (1.9) has the 
value b. 


(b) If y = f(x) is a semi-circle with diameter P,P,, so that f(x)? = bx — x”, show 
that 


ae i 1 
si=5 f in To. 


By putting x = bsin? ¢, show that S = rb/2. 


1.2.2 The stationary distance 


We require the equation defining the functions for which S|[y] is stationary, 
that is the stationary paths. However, as in ordinary calculus, the idea of 
a stationary point of a function is preceded by the notion of the rate of 
change of functions. Similarly, in the calculus of variations we first need to 
consider the change of S[y] with changes in y; then we can define the idea of a 
stationary path and, hopefully, find these paths. These ideas are introduced 
here, and developed in Chapter 2, using the analogy of the stationary points 
of functions of many real variables. 


You will recall from the discussion about stationary points of a function of n 
variables (Block 0, Subsection 2.2.7) that we define a stationary point to be 
one at which the values of the function at all neighbouring points are ‘almost’ 
the same as at the stationary point. To be precise, if G(a)is a suitably 


behaved function of n real variables, x = (21, %2,...,%n), we compare values 
of G at x and the nearby point a + €€, where |e| < 1 and |€| = 1, 
— 0G 
G(x + e€) — G(a) =e ag O(e). (1.10) 
k 
oe | 


A stationary point is defined to be one for which the term O(e) is zero 
for all €. This gives the familiar conditions for a point to be stationary, 
namely 0G/Ox, = 0 for k = 1,2,...,N. 


For a functional we proceed in the same way. That is, we choose two adjacent 
paths joining P, to P, and compare the values of S along these paths. If 
one is represented by a differentiable function y(x), adjacent paths may be 
represented by y(a) + eg(x), where € is a real variable and g(x) is another 
differentiable function. Since both paths pass through P, and Py, we require 
y(a) = A, y(b) = B and g(a) = g(b) = 0; otherwise g(x) is arbitrary. The 
difference 


6S = Sly + eg] — Sly! (1.11) 


may be considered as a function of the real variable ¢, for arbitrary y(x) 
and g(x) and for small values of |e]. When ¢ = 0, 6S = 0, and for small 
le| we expect 6S to be proportional to €; this is true in general, as seen in 
equation (1.17) below. 


However, there may be some paths for which 6S is proportional to e?, rather 
than «. These paths are special, and we define them to be the stationary 
paths, stationary curves or stationary functions. Thus, a necessary condition 
for a path y(x) to be a stationary path is that 


Sly + eg] — Sly] = O(€*) (1.12) 


In real analysis bold 
characters are often used 
to denote sets of variables; 
this notation will be used 
frequently in this block, 
particularly in Chapter 3. 


Definitions of the symbols 
< and > are given in the 
Handbook. 
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for any sufficiently well behaved function g(x) satisfying g(a) = g(b) = 0. 
The equation for the stationary function y(x) is obtained by examining this 
difference more carefully. 


The distances along these two adjacent curves are 


b 
sul= | io gar (1.13) 


and 


b 
Chee / dz JI + Wi@ +g @Pe. (1.14) 


We proceed by expanding the integrand of S|y + eg] in powers of €, retaining 
only the terms proportional to «. One way of making this expansion is to 
consider the integrand as a function of € and to use the Taylor series to 
expand in powers of ¢; in this manipulation x is treated as a constant, and 
hence so are y’(x) and g(x). Thus 


d 
Viti a= ly? +e|s Fura + O(€") 


/ rYyoF 
= _/) 4a eee ha ae + O(e) 
gf i ie ee | 
/ / 
= /S/1+y'(a)? + gE aD ep O(e?). (1.15) 


L+y(x) 


Substituting this expansion into the integral gives 


. ey’ (x)g' (x 
Sly + €g] =| dx ( Teva + Pee | Oe) 
; y' (a) 


= Sly] +e j dx Fiayae! + O(e*). (1.16) 


The difference between the two distances is therefore 


Sly+ eg| — Sly] = € poe i + O(€*). (1.17) 
a wit 

This difference depends upon both y(x) and g(x), just as for functions of 

n real variables the difference G(a + «€) — G(x), equation (1.10), depends 

upon both x and €, the equivalents of y(z) and g(x) respectively. 


If S[y]| is stationary it follows, by definition, that 


[e UE) ae (1.18) 
¢ —————. g (x) = 0, 

ae «a/lty'ley 

and that this equation must hold for all functions g(x) for which g(a) = 
g(b) = 0 and g’(x) is continuous; we shall see in the next chapter that these 
properties are sufficient to determine y(a) uniquely. Here, however, we sim- 
ply show that if 


y'(x) 
Ly 
then the integral in equation (1.18) is zero for all g(x). Thus, if equa- 
tion (1.19) is true then equation (1.18) becomes 


= a = constant, (1.19) 


b 
/ dx ag'(x) = alg(b) — g(a)] =0_ since g(a) = g(b) = 0. (1.20) 


In Section 2.3 we shall show that condition (1.19) is necessary as well as 
sufficient for equation (1.18) to hold. 
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Rearranging equation (1.19) gives y'(x) =m, where m is a constant. Inte- 
gration now gives the general solution, 


y(x) = mxz+c, (127) 


for another constant c: this is the equation of a straight line, as expected. 
The constants m and c are determined by the conditions that the straight 
line passes through P, and P: 


B-A Ab — Ba 
= —— ——_—, 1.22 
y() == —2 + (1.22) 
This analysis shows that the functional S{y] defined in equation (1.9) is 
stationary along the straight line joining P, to Py. We have not shown that 


this gives a minimum distance: this is done in Exercise 1.7. 


Exercise 1.3 


Show that the equation of the straight line, y = mx +c, passing through the distinct 
points (a, A) and (b, B) is given by equation (1.22). Why does this equation fail if 
b=aand AF B? 


Exercise |.4 


Use the stationary path defined by equation (1.22) in the ery (1.9) to show 


that along this path the distance between P, and P, is S = ,/(b— a)? + (B-— A)? 


Exercise 1.5 


Use the method described in Subsection 1.2.2 on the functional 


Sul= f de VTF¥@), 40) =0, y(t) =B>-1 


to show that on a stationary path y’(x) = constant. Hence show that the stationary 
function is the straight line y(a) = Bz and also that the value of the functional on 


this line is S[y] = /1+ B. 
Exercise |.6 
Consider the functional 
Sly] = [ dxxy'*, y(1)=0, y(2) = 1. 
Show that 
Sly + eg] — Sly] = 2e [ da xy'g' + O(€*) 


and, employing the same analysis as used to derive equation (1.22), show that, if 
g(1) = g(2) = 0, the stationary path satisfies ry’(x) = constant. Integrate this 
equation to show that the stationary function is y(x) = Ina/In2. 
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1.2.3 Geodesics 


Problems involving shortest distances on surfaces other than a plane illus- 
trate other features of variational problems. For example, if we replace the 
plane by the surface of a sphere then the shortest distance between two 
points on the surface can be shown to be the length of an arc of the great 
circle joining the two points, i.e. the circle created by the intersection of 
the spherical surface and the plane passing through the two points and the 
centre of the sphere. Now, for most points, there are two stationary paths 
corresponding to the long and the short arcs of the great circle. However, 
if the points are at opposite ends of a diameter, there are infinitely many 
shortest paths. This example suggests that solutions to variational problems 
can be complicated. 


In general, the stationary paths between two points on a surface are called 
geodesics. For a plane surface the only geodesics are straight lines; for a 
sphere, most pairs of points are joined by just two geodesics, i.e. both 
segments of the great circle through the points. For other surfaces there 
may be several stationary paths; an example of the consequences of such 
complications is described next. 


Gravitational lensing 


In Ejinstein’s general theory of relativity, the path taken by light from a 
source to an observer is along a geodesic on a surface in a four-dimensional 
space. In this theory gravitational forces are represented by distortions to 
this surface. The theory therefore predicts that light is ‘bent’ by gravi- 
tational forces, a prediction that was first observed in 1919 by Eddington’s 
measurements of the positions of stars during a total solar eclipse. ‘These ob- 
servations provided the first direct confirmation of Einstein’s general theory 
of relativity. 


The departure from a straight-line path depends upon the mass of the body 
between the source and observer. If it is sufficiently massive then two or 
more images may be seen, as illustrated schematically in Figure 1.6. 


Quasar image 
OP, 


——— 
Quasar image 


Light paths 


Figure 1.6 Diagram showing, schematically, how an intervening galaxy can 
sufficiently distort a path of light from a bright object, such as a quasar, to provide 
the two stationary paths and hence the two images shown. Many examples of 
such multiple images, and more complicated but similar optical effects, have now 
been observed. Usually there are more than two stationary paths. 
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1.2.4 The shortest path 


~ In this subsection we show that the straight line given by equation (1.22) 
gives a minimum distance. This involves the expansion of S|y + eg] to Oter). 
The general theory that describes the behaviour of this second-order term 
is difficult, and it is not part of this course; the material of this subsection 
is therefore not assessed. 


First, recall that for functions of n variables the nature of a stationary point 
is deterinined by making a second-order Taylor expansion about a point 
f= w, 


Gla +&) = Ga )+ > oe aHeDy ae E,€; +--+, (1.23) 


je=n], gue] 


where all derivatives are evaluated at x = a. This is a generalisation of the 
three-dimensional Taylor expansion given in Block 0 (Subsection 2.2.7). If 
G(a:) is stationary at 2 = a then all first derivatives are zero. The nature of 
the stationary point is usually determined by the behaviour of the second- 
order term. For a stationary point to be a minimum it is necessary for the 
quadratic term to be strictly positive for all €, i.e. 


re 
i> O - for all £.. 8, f= 1,35. 5% 1.24 
ad Bx,0a; °° or all €% 585, J n (1.24) 


provided |€| = 1. The stationary point is a maximum if this quadratic form 
is strictly negative. Usually for large n it is difficult to determine whether 
these inequalities are satisfied, although there are well-defined tests. The 
relevant test for n = 2 is given in Chapter 2 of Block 0. 


For a functional we proceed in the same way: the nature of a stationary path 
is detérmined by the second-order expansion. If S[y] is stationary then, by 
definition, 


Sly + eg] — Sly] = Kly, gle” + O(€*) (1.25) 


for some quantity K[y,g], depending upon both y and g; examples of this 
expansion are found in Exercises 1.7 and 1.8. Then Sly] is a minimum if 
‘ y,g\ > 0 for all g(x), and a maximum if K[y, g] < 0 for all g(x). Normally 
it is dificult to establish these inequalities, and the necessary general theory 
is not part of this course. For the functional defined by equation (1.9), how- 
ever, the proof is straightforward; the following exercise guides you through 


Exercise |.7 


(a) Using either the binomial expansion or a Taylor expansion show that 
a Be Me ee 
Aga, . Cine 


‘b) Use this result to show that if y(a) is the straight line defined in equation (1.22) 
and Slyi is the functional (1.9), then 


Jit (ate)? = V1+a? +4 + O(e). 


e* B-A 


b 
Sly +eol~ Sl = sea anem f dea)”, m= 5 


x: 


Deduce that the straight line is a minimum distance between P, and P,. 


|.3 Two generalisations 


Exercise 1.8 
In this exercise we consider the functional defined in Exercise 1.5 in more detail. 


By expanding the integrand of Sly + eg] to second order in ¢€, show that if y(x) is 
the stationary path, then 

za 
8(1 + Bs? 
Hence deduce that the path y(x) = Bx, B > —-1,0< a <1, is a maximum of this 
functional. 


Sly + eg] = S{y] — / dx g'(x)*+ O(e?), B>-1. 


1.3 Two generalisations 


In this section we extend the preceding analysis to deal with more gen- 
eral functionals. First, we consider functionals having integrands of the 
form F(y’), where F' is any suitably well-behaved, real function of a single 
real variable. By setting F = \/1+ y’?, we regain the example treated in 
Subsection 1.2.2. Second, we generalise this by considering integrands of 
the form F'(x,y'), which depend explicitly upon xz and y’; an example is 


et ey? 


1.3.1 Functionals depending only upon (x) 


The functional (1.9) (page 9), depends only upon the derivative of the un- 
known function. Although this is a special case, it is worth considering it in 
more detail in order to develop the notation we need. 


If F(z) is a function of z, which we need to be differentiable, then a general 
functional of the form (1.9) is 


b 
sul= [ drF(’), yla)=4, y)=B, (1.26) 


where F'(y’) simply means that in F(z) all occurrences of z are replaced by 
y'(x). Thus in the previous example 


F(z)=V1+22 so Fly)=VJ14+y(z)?. (h.2P) 
Note that the symbols F'(y’) and F'(y'(x)) denote the same function. 


The difference between the functional evaluated along y(xz) and the adjacent 
paths y(a) + eg(x), where |e] < 1 and g(a) = g(b) = 0, is 


b 
Sy +e] ~ Sy] = f ae [Fy +e9')- FW), (1.28) 


a 


Now we need to express F'(y’ + €g’) as a series in €; because F(z) is differ- 
entiable, Taylor’s theorem gives 


F(z+eu) = F(z) + cu F(2) + O(e*). (1.29) 


The expansion of F'(y’ + €g’) is obtained from this simply by replacing z by 
y' (x) and u by g/(x), which gives 
d 


Pig += Fo) = oe (y’) + O(€*), (1.30) 
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where the notation dF'/dy’ means 

—F(y') = —F(z) (1.31) 
z=y' (x) 
For instance, if F(z) = V1+ 2? then 

ee: dF y' (x) 


— = ——_ and  — = ———. 1.32 

dz /14 22 dy’ = ,/1+ y'(x)? — 
Exercise 1.9 
Find the expressions for dF'/dy’ for the following cases of F'(y’): 
(a) Fly’) =(1+y'?)4 
(b) F(y’) =siny’ 
(c) F(y’) = exply’) 
Substituting the difference (1.30) into the equation (1.28) gives 

b 
d 
sy +eg\— Sul =f dry) FW!) +0). (1.33) 


By definition the functional S{y] is stationary if the term O(e) is zero for all 
suitable functions g(a). We now give a sufficient condition, deferring until 
Chapter 2 the proof that it is also necessary. In this analysis it is important 
to remember that F(z) is a given function and that y(x) is an unknown 
function that we need to find. Observe that if 

d 


art a = a = constant, (1.34) 
Y 


then equation (1.33) becomes 


b 
Sly +g] — Sly] = ea / dx g{(«) + O(2) 


a 


= ea(g(b) — g(a)) + O(€*) 


= O(e’). (1.35) 
In general, equation (1.34) is true only if y’() is also constant, so 
y(x) = maz-+c. (1.36) 
Therefore 
B-A Ab— Ba 
if 


using the boundary conditions y(a) = A and y(b) = B. 


This is the same solution as given in equation (1.22). Thus, for this type of 
functional, the stationary function is independent of the form of the inte- 
grand although the nature of the stationary function is not; see for instance 
Exercise 1.22 (page 29). The exception is when F(z) is linear, in which 
case the value of S[y] depends only upon the end points and not the values 
of y(x) in between; then there is no stationary path, as shown in the next 
exercise. 


Since g(a) = g(b) = 0. 


|.3 Two generalisations 


Exercise 1.10 


If F(z) = Cz+D, where C and D are constants, show that equation (1.34) does 
not imply that y'(a) = constant. 


In this case show that the value of the functional Sy =f dx Fy’) is independent 
of the chosen path. 


Note that this is the only example where the equation dF'/dy’ = constant does not 
imply that y’(a) = constant. 


1.3.2 Functionals depending upon x and 4'(zx) 


Now consider the slightly more general functional 


b 
Sly] = / dn F(a,y'), M@=Ay gO) =, (1.38) 


where the integrand F(x,y’) depends explicitly upon the two variables x 
and y’, which are treated as independent variables. The difference in the 
value of the functional along adjacent paths is 


b 
Sly + eg] — Sly] = / dx F(a, y’ +g’) — F(z,y’')| (1.39) 


In this example F(x, z) is a function of two variables and we require the 
expansion 


F(x2,z+eu) = F(z, z) teu om ftw’ (1.40) 


where Taylor’s series for functions of two variables is used. Comparing this 
with the expression preceding equation (1.30) we see that the only difference 
is that the total derivative with respect to y’ has been replaced by a partial 
derivative. As before, on replacing z by y’(x) and u by g’(x), equation (1.39) 
becomes 


b 0 
Sly + eg] — Sly] =e / tx g(0) 5 Flay) + OC), (1.41) 
where 
O O 
ap F ew) = ge] (1.42) 


If y(x) is the stationary path, it is necessary that 


: O 
/ dx g(x "aa -F(x,y')=0 for all g(z). (1.43) 


As before a sufficient condition for this is that OF /Oy’ = constant, which 
gives a first-order differential equation for y(x), 


ag y)=c, ya)=A, yO) =B, (1.44) 
where c is aconstant. This is the equivalent of equation (1.34), but now the 


explicit presence of x in the equation means that y'(x) =constant is not a 
solution. 
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Exercise I.11 


Consider the functional 


1 
Sly] =| de,fl+a2+y*, gW=A>O, gij=8> A. 
0 
Show that the function y(x), defined by the relation 


y (2) =er/1+e+y9 ey, 


where c is a constant, makes S[y] stationary. By expressing y'(x) in terms of 2, 
solve this equation to show that 


1.4 Notation 


In the previous sections we used the notation F'(y’) to denote a function of 
the derivative of y(x) and proceeded to treat y’ as an independent variable, so 
that the expression dF’/dy’ had the meaning defined in equation (1.31). This 
notation and its generalisation are very important in subsequent analysis; it 
is therefore essential that you are familiar with them and can use them. 


Consider a function F(x, u,v) of three variables, for instance F = xrVu? + v?, 
and assume that all necessary partial derivatives of F'(x,u,v) exist. If y(x) 
is a function of x we may form a function of x by replacing u by y(a) and 
v by y’(x), thus 


F(z,u,v) becomes F(z, y,y’). 


Although this is a function of x, it is more often convenient to consider 
it as a function of three independent variables (x,y, y’). The first partial 
derivatives with respect to y and y’ are simply 


(1.45) 


a eae 
a = BP (ee ¥) 


u=y,v=y' 


and 


(1.46) 


0 0 
ee”: = 
Oy! (x,y, y') Aye (es Uv) 


u=y,v=y' 
Because y depends upon x, we may also form the total derivative of F(z, y, y’) 
with respect to x using the chain rule 
dF OF OF dy OF dy 
dx Ox Oydzx Oy' dz 
= —— + —4'(zx) a ay!” (21. (1.47) 


In the particular case F(x, u,v) = 2Vu? + v’, with u=y and v =y/, these 
rules give 


OF OF ry OF vy! 
Seceeice iio. cae ad ee —S = sw——————__, 1.48 
Ox Yy To Oy ly2 + y!? Oy’ fy? 4 yf 2 ( ) 
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Similarly, the second-order derivatives are 


er _ OF OF _ PF 
Oy? Ou ver Oy! Ove u=y,v=y" 
O°F O° F 
q - 1.4 
5 OyOy’ Oudv oe a 


Because you must be familiar with and be able to use this notation we sug- 
gest that, before proceeding, you attempt as many of the following exercises 
as time permits. 


Exercise 1.12 
For the functions 

Fy(z,y) =sin(e+y), Fo(a,y)=cos(x+y*) and F3(x,y) = exp(zy), 
where y = y(x), find OF;,/Ox, OF;,/Oy and dF), /dx, for k = 1, 2,3. Also, show that 


d (ah) , 0 (dh 
dx \ Oy’ Oy’ \ dx }- 
Exercise 1.13 


For the function F = ,/x? + y’?, find 
OF OF OF dF 


dz’ Oy’ Oy” da 
Also, show that 


d (OF\ _ oO (dF 
dz \ Oy’) Oy’ \ dx) © 
Exercise 1.14 


For the function F = ,/x? + yy’?, find 
ee oF nd 
Ox’ Oy’ Oy! dx 


Exercise 1.15 
Show that for an arbitrary differentiable function F(z, y, y’) 
da OF = rr . OPPs , 0° F 
dx Oy! cs Byl2? AyOy! ” Oxdy! 
Hence show that 

d (a) , 0 (ak 

dx \ Oy! oy \ dx j’ 


with equality only if F does not depend explicitly upon y. 


Exercise 1.16 


Use the first identity found in Exercise 1.15 to show that the equation 


a (aF\_ OF _, 
dx \ Oy’ Oy 


is equivalent to the second-order differential equation 


Fr. ri Or . @F OF | 
By?” * Gyay’” * axdy’ dy 

Note: the first equation will later be seen as crucial to the general theory described 
in Chapter 2. The fact that it is a second-order differential equation means that 


unique solutions can be obtained only if two initial or two boundary conditions are 
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given. Also the fact that the coefficient of y’(x) is 07 F/Oy’? is very important in 
the general theory of the existence of solutions of this type of equation. 


Exercise 1.17 


(a) IF FYy,y')=yVl+y", find This is a hard exercise 
OF OF Ryan a2F which you are advised to 


sal bee do for revision. 
Oy’? Oy'’ dy'2’ — Aydy’ 
and show that 


d OF OF Pas 12 —3/2 2d y’ = 
= (3) 5 7 he ) (v =(5) t). 


(b) By solving the equation y?(y'/y)/ = 1 show that a non-zero solution of 


d (OF OF A 
= (= Dy 0 is y 7 08 (Az + B), 


for some constants A and B. 


[Hint: let y be the independent variable and define a new variable z by the 
equation yz(y) = dy/dx to obtain an expression for dy/dx that can be inte- 
grated. 


1.5 Some examples of functionals 


In this section we describe a variety of problems that can be formulated 
in terms of functionals, with solutions that are stationary paths of these 
functionals. ‘This list is provided because it is likely that you will not be 
familiar with these descriptions and will be unaware of the wide variety of 
problems for which variational principles are useful, and sometimes essential. 
You should not spend long on this section if time is short; in this case you 
should aim at obtaining a rough overview of the examples. Indeed, you may 
move directly to Chapter 2 and return to this section at a later date, if 
necessary. 


In each of the following subsections, a different problem is described and 
the relevant functional is written down; some of these are derived later. In 
compiling this list one aim has been to describe a reasonably wide range of 
applications: if you are unfamiliar with the underlying physical ideas behind 
any of these examples, do not worry because they are not an assessed part 
of the course. Another aim is to show that there are subtly different types of 
variational problems, for instance the isoperimetric and catenary problems, 
described on pages 22 and 25, respectively. ‘hese variants are not considered 
further in this course, but you should know of their existence. 


|.5 Some examples of functionals 


1.5.1 The brachistochrone 

Given two points P, = (a, A) and P, = (b, B) in the same vertical plane, as 
illustrated in the diagram below, we require the shape of the smooth wire 
joining P, to P, such that a bead sliding on the wire under gravity, with no 
friction, and starting at P, with a given speed shall reach P, in the shortest 
possible time. 


Pp 


Figure 1.7 The curved line joining P, to P, is a segment of a cycloid, as shown in 
Section 2.7. In this diagram the axes are chosen to give a= A= 0. 


The name given to this curve is the brachistochrone, which comes from the 
Greek brachystos, meaning shortest, and chronos, meaning time. 


If the y-axis is vertical, it can be shown that the time taken along the curve 
y(x) 


wt of dy - 
Tyl= | ae @ —aegey y(a)=A, y(b) =B, 


where g is the acceleration due to gravity and C’ a constant depending upon 
the end points and the initial energy of the particle. This expression is 
derived in Section 2.7. 


(1.50) 


This problem was first considered by Galileo in his 1638 work Two new sci- 
ences, but lacking the necessary mathematical methods he concluded, erro- 
neously, that the solution is the arc of a circle passing vertically through P,. 


It was Bernoulli, however, who made the problem famous when in June 1696 
he challenged the mathematical world to solve it. He followed his statement 
of the problem by a paragraph reassuring readers that the problem was very 
useful in mechanics, that it is not the straight line through P, and P, and 
that the curve is well known to geometers. He also stated that he would 
show that this is so at the end of the year, provided no one else had. 


In December 1696 Bernoulli extended the time limit to Easter 1697, though 
by this time he was in possession of Leibniz’s solution, sent in a letter dated 
16 June 1696, Leibniz having received notification of the problem on 9 June. 
Newton also solved the problem quickly, apparently on the day of receipt, 
and published his solution anonymously. 


The curve giving this shortest time is a segment of a cycloid, which is the 
curve traced out by a point fixed on the circumference of a vertical circle 
rolling, without slipping, along a straight line. The parametric equations of 
the cycloid shown in Figure 1.7 are 


x=a(O—sin@), y= -—a(1—cos8), (-L01.) 


where a is the radius of the circle: these equations are derived in Subsec- 
tion 2.7.1. 
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Galileo Galilei, 1564-1642. 


Johann Bernoulli, 
1654-1705. 


Gottfried Wilhelm Leibniz, 
1646-1716. 


Further details of this 
history and these solutions 
may be found in Chapter 1 
of H. H. Goldstine, A 
history of the calculus of 
variations from the 17th 
through the 19th century 
(Springer-Verlag, 1980). 
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1.5.2 Minimal surface of revolution 


Here the problem is to find a curve y(x) passing through two given points 
P, = (a, A) and P, = (b, B), with A > 0 and B > 0, as shown in Figure 1.8, 
such that when rotated about the x-axis the area of the curved surface 
formed is a minimum. 


Figure 1.8 Diagram showing the cylindrical shape produced when a curve y(2), 
joining (a, A) to (b, B), is rotated about the x-axis. 


The area of this surface is shown in Section 2.5 to be 


b 
Sig} = an f dxy(x)\/1+y'(ax)?, (1.52) 


and we shall see that this problem has solutions that can be expressed in 
terms of differentiable functions only for certain combinations of A, B and 
b—a. 


1.5.3 A problem in navigation 


Given a river with straight, parallel banks a distance a apart and a boat 
that can travel with constant speed c in still water, the problem is to cross 
the river in the shortest time, starting and landing at given points. 


If the y-axis is chosen to be the left bank, the starting point to be the origin 
and the water is assumed to be moving parallel to the banks with speed 
v(x), a known function of the distance from the left-hand bank, then the 
time of passage along the path y(x) is, assuming c > max(v(2z)), 


VETE Twp - 
v= [ae on eee , (1.53) 


with boundary conditions y(0) = 0 and y(a) = A, where the final destination 
is a distance A along the right-hand bank. The derivation of this result is 
set in Exercise 1.28, one of the harder exercises at the end of this chapter. 


1.5.4 The isoperimetric problem 


Among all curves, represented by functions with continuous derivatives, that 
join the two points P, and P, in the upper half plane and have given length 
L, determine the one that encompasses the largest area, Aly], shown in 
Figure 1.9. 
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a b 


Figure 1.9 Diagram showing the area Aly] under a curve of given length joining 
F, t0.f,. 


This is a classic problem discussed by Pappus of Alexandria in about AD 300. 
Pappus showed that of two regular polygons having equal perimeters the one 
with the greater number of sides has the greater area. In the same book he 
demonstrates that for a given perimeter the circle has a greater area than 
does any regular polygon. This work seems to follow closely the earlier work 
of Zenodorus (c. 180 BC): extant fragments of his work include a proposition 
that of all solid figures, the surface areas of which are equal, the sphere has 
the greatest volume. 


This intuitively obvious result is enshrined in the myth describing the foun- 
dation of the Phoenician city of Carthage in 814Bc: this is that Dido, also 
known as Elissa, having fled from Tyre after her brother, King Pygmalion, 
had killed her husband, was granted by the Libyans as much land as an ox- 
hide could cover. By cutting the hide into thin strips, she was able to claim 
far more ground than anticipated: this early display of deviousness was con- 
sidered by the Greeks and Romans a Punic trait. As with all foundation 
myths there is no trace of evidence for its veracity. 


Returning to Figure 1.9, a modern analytic treatment of the problem re- 
quires a differentiable function y(x) satisfying y(a) = A, y(b) = B, such 
that the area, 


b 
Aly) = f deu(e) (1.54) 


is largest when the length of the curve, 


p= [uw J1+y'(a)?, (1.55) 


is given. It transpires that a circular arc is the solution. 


This problem differs from the first three because an additional constraint — 
the length of the curve — is imposed. In this course we do not consider this 
type of problem, although another such example is described in Section 1.5.6. 


1.5.5 Fermat’s principle 


Light, and other forms of electromagnetic radiation, is a wave phenomenon. 
However, in many common circumstances light may be considered to travel 
along lines joining the source to the observer: these lines are called rays and 
are often straight lines. This is why most shadows have distinct edges and 
why eclipses of the Sun are so spectacular. 
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In a vacuum, and normally in air, these rays are straight lines and the 
speed of light is c ~ 2.9 x 10!¥cms™!, independent of its colour. In other 
uniform media, for example water, the rays also travel in straight lines, 
but the speed is different: if the speed of light in a uniform medium is cp, 
then the refractive index is defined to be the ratio n = c/cm. The refrac- 
tive index usually depends on the wavelength: thus for water it is 1.333 for 
red light (wavelength 6.5 x 107-°cm) and 1.343 for blue light (wavelength 
7.5 x 10~-°cm); this difference in the refractive index is one cause of rain- 
bows. In non-uniform media, in which the refractive index depends upon 
the position, light rays follow curved paths. Mirages are one consequence of 
a position-dependent refractive index, where light passes through layers of 
air of differing densities. 


A simple example of the ray description of light is the reflection of light in 
a plane mirror, as shown in Figure 1.10. In the diagram the source is S and 
the light ray is reflected from the mirror at R to the observer at O. The 
plane of the mirror is perpendicular to the page and it is assumed that the 
plane SRO is in the page. 


Figure 1.10 Diagram showing light travelling from a source S to an observer at O, 
via a reflection at R. The angles of incidence and reflection are defined to be 6; 
and 62, respectively. 


Light travels in straight lines and is reflected from the mirror at a point 
R as shown in the diagram, but without further information the position 
of R is unknown. Observations, however, show that the angle of incidence, 
6,, and the angle of reflection, 02, are equal. This law of reflection was 
known to Euclid (c. 300 BC) and Aristotle (384-322 BC), but it was Hero of 
Alexandria (c. 125 BC) who showed by geometric argument that the equality 
of the angles of incidence and reflection is a consequence of the Aristotelian 
principle that nature does nothing the hard way. That is, if light is to travel 
from the source S to the observer O via a reflection in the mirror then it 
travels along the shortest path. 


This result was generalised by the French mathematician Fermat into what Pierre de Fermat, 
is now known as Fermat’s principle, which states that the path taken by (1601-1665). 
light rays is that which minimises the time of passage. 


Fermat’s original statement was that light travelling between two points 
seeks a path such that the number of waves is equal, as a first approxi- 
mation, to that in a neighbouring path. This formulation has the form of 
a variational principle, which is remarkable because Fermat announced this 
result in 1658, before the calculus of either Newton or Leibniz was developed. 


For the mirror, because the speed along SR and RO is the same, this just 
means that the distance along SR plus RO is a minimum. If AB = d and 
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AR =z the total distance travelled by the light ray depends only upon x 
and is 


f(x) = 4/22 +h? + 4/(d—2)? + hi. (1.56) 


This function has a minimum when 6; = 62, that is when the angle of in- 
cidence, 0;, equals the angle of reflection, 02. This result is derived in 
Exercise 1.20. 


In general, for light moving in the Oxy-plane, in a medium with refractive 
index n(x, y), with the source at the origin and observer at (a, A) the time 
of passage, T, along an arbitrary path y(x) joining these points is 


Tig = an dzn(x,y)4/ Py?) (py =0;" g(a) = A. (1:57) 


c 


This follows because the time taken to travel along an element of length 0s 
is n(x, y)ds/c and 6s = \/1+y'(x)? 6x. If the refractive index, n(z,y), is 
constant, this integral reduces to the integral (1.9) and the path of a ray is 
a straight line, as would be expected. 


Fermat’s principle can be used to show that for light reflected at a mirror 
the angle of incidence equals the angle of reflection. For light crossing the 
boundary between two media it gives Snell’s law, 

sin Q1 Cj 


Sa 1.58 
sinagg 2. (1.58) 


where a; and a» are the angles between the ray and the normal to the 
boundary and cz, k = 1,2, are the speeds of light in the media, as shown 
in Figure 1.11. For example, in water the speed of light is approximately 
C2 = c,/1.3, where c; is the speed of light in air, so 1.3sinaz = sina. 


Figure |.1! Diagram showing the refraction of light at the surface of water. ‘The 
angles of incidence and refraction are defined to be ag and a, respectively; these 
are connected by Snell’s law. 


In this figure the observer at O sees an object S in a pond and the light 
ray from S to O travels along the two straight lines SN and NO, but the 
observer perceives the object to be at S’, on the straight line ON. This 
explains, for instance, why a stick put partly into water appears bent. 


1.5.6 The catenary 


A catenary is the shape assumed by an inextensible chain of uniform density 
hanging between supports at both ends. In Figure 1.12 we show an example 
of such a curve when the points of support, (—a, A) and (a, A), are at the 
same height. 


Willebrord Snell, 
1591-1626. 
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—@ a 4 6 


Figure 1.12 ‘The catenary formed by a uniform chain hanging between two points 
at the same height. 


If the lowest point of the chain is taken as the origin, the catenary equation 
is 


y =e (cosh (=) - 1) . (1.59) 


for some constant c determined by the length of the chain and the value 
of a. 


If the shape of the chain is described by the differentiable function y(x) it 
can be shown (see Exercise 1.25) that the potential energy F of the chain is 
proportional to the functional 


a 

a5 = Re (1.60) 
—a 

The function, y, that minimises this functional, subject to the length of the 

curve L = [ a dz \/1+ y’? remaining constant, is the shape assumed by the 

hanging chain. 


1.5.7 Coordinate-free formulation of Newton’s 
equations 


Newton’s second law for an isolated system of N interacting particles relates 
the acceleration of the kth particle to the force acting upon it from all the 
other particles, 
2 

my Tk = Fy(r.r2,.--sN) Ze & sere - (1.61) 
where m, is the mass of the kth particle at r;,. ‘This law of motion accurately 
describes a significant portion of the physical world, from the motion of 
large molecules to the motion of galaxies. However, when formulated in this 
manner a number of difficulties arise, some practical and some theoretical, 
the latter concerning the underlying structure of this system of nonlinear 
equations and their solutions. — 


More general systems, affected by external forces, can also be described by 
Newton’s equations and we use one such system to illustrate the practical 
difficulties frequently encountered when using Newton’s laws as originally 
formulated. This system is the simple pendulum, constrained to move in a 
vertical plane, as shown in the following diagram. Here the mass at P is 
attached to the point O, taken to be the origin, by a light rod of length J, 
swinging freely, that is without friction, about O in the plane of the paper. 
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mg 
Figure 1.13 Diagram showing the geometry of the simple pendulum. 


If the y-axis points vertically upwards, the position of the mass at P has the 
coordinates r = I(sin@,—cos@), where the angle @ is defined in the figure. 
The external gravitational force acting on the mass is F = (0,—mg). In 
addition there is the tension of magnitude T in the rod, responsible for 
ensuring that the distance OP is constant. Because the length OP is fixed 
the coordinates of P depend upon only one parameter, 6, but Newton’s 
equations of motion involve the two coordinates of r and the two forces F' 
and T’. 


Often we are not interested in the tension because it does not affect the 
motion, until it exceeds the breaking strain of the rod. Thus, ideally, we 
require a more ‘natural’ formulation of the equation of motion for 6 which 
does not involve T. Also, observe that the vector velocity and acceleration 
of P in Cartesian coordinates are 


r = 1(0cos0, 6 sin 6) (1.62) 
and 


Pad (G cos — 9? sin 6, @ sin 6 + 6? cos 0) (1.63) 


The acceleration depends on 0, @ and @ in a fairly complicated manner; in 
more complex problems the equivalent algebra is more complicated and adds 
additional and unnecessary difficulties to the formulation of the equations 
of motion (see Chapter 4). 


In this example the tension TJ is referred to as a constraining force — be- 
cause it constrains the distance OP to be constant. Another example of a 
constraining force is the reaction between a curved surface and a particle 
moving smoothly on it: in this case the constraint stops the particle falling 
through the surface. 


Constraining forces are clearly of physical importance, but it is often helpful 
to use a formulation of Newton’s equations that avoids their use. By using a 
variational principle, not only are these forces avoided but so are the difficul- 
ties encountered when expressing 7 in non-Cartesian coordinates. Further- 
more, the variational formulation highlights the underlying mathematical 
structure of Newton’s equations, which allows sophisticated methods to be 
used to help solve the equations. It also provides guides to the structure of 
the equations needed when Newton’s laws are not valid, as for example when 
speeds approach that of the speed of light (the relativistic limit), or when 
the momentum of particles become very small, that is atomic and nuclear 
systems, when quantum mechanics is needed. In this course, in particular 
in Chapter 4, we lay the foundations for the development of some of these 
ideas. 
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1.6 Summary and Outcomes 


These few examples provide some idea of the significance of variational prin- 

ciples. In summary, they are important for three distinct reasons: 

e A variational principle is often the easiest or the only method of formu- 
lating a problem. 

e A variational formulation provides a coordinate free method of express- 
ing the laws of dynamics, allowing powerful analytic techniques to be 
used in ordinary Newtonian dynamics. They also pave the way for the 
formulation of dynamical laws describing motion of objects moving at 
speeds close to that of light (special relativity) and particles interact- 
ing through gravitational forces (general relativity) and the laws of the 
microscopic world (quantum mechanics). 

e Often conventional boundary-value problems may be reformulated in 
terms of a variational principle; this provides a powerful tool for approx- 
imating solutions. (This theory is not covered in this course.) 


After studying this chapter you should: 

e have an idea of what a functional is and some appreciation of their 
applications; 

e beable to determine the conditions required of a differentiable path that 
makes functionals of the form 


b b 
sui= | dx F(y') or sul= | dx F(z, y’) (1.64) 


stationary; 
e be able to perform elementary calculations on the partial derivatives of 
functions of three variables. 


1.7 Further Exercises 


Exercise 1.18 


Functionals do not need to have the particular form considered in this chapter. 
The following expressions, where a(x) and b(x) are prescribed functions, also map 
functions to real numbers. 


(a) Diy] = y'(Q) 


>) Kul= | “dza(2) [y(2) + u(Du'@) 


(c) Llu) = [ev(eyy'(e)] + fae [aCw)y'(w) + Wo)u(a)| 


Find the values of these functionals for each of the functions y(2) = x? and y(x) = cos 7x 
when a(x) = x and W(x) = 1. 


Exercise 1.19 


If F(y,y’) = exp(ay + y’), where a is a constant, show by direct calculation that 


hes cal a ne except if a = 0 
dx \ Oy’ Oy’ \ dx r ae 
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Exercise 1.20 


Show that the function 


f(a) = 4/2? + hi + 4/(d—2)? + hg, 


where hy, he are defined in Figure 1.10 (page 24) and x and d denote the lengths 
AR and AB respectively, is stationary when 0; = 62 where 
d—2x 


My 
7 are sin 02 = ——— 
Vf a? + hz /(d— ax)? + hs 


Show that at this stationary value f(x) has a minimum. 


sin al = 


Exercise |.21 


Show that the functionals 


b b 
si) = | dz(1+ay')y and Satu) = | iar 


where b > a> 0, y(b) = B and y(a) = A, are both stationary on the same curve, 
namely 


y(t) = A+(B—A) ES | 


Explain why the same function makes both functionals stationary. 


Exercise 1.22 


Show that the functional 


1/4 


Sul= f de(1+y'@)"", v)=0, y(t) =8, 


is stationary for the straight line y(x) = Bz. 


In addition, show that this straight line gives a minimum value of the functional 
only if B< /2, while for B > V2 it gives a maximum. 


1.8 Harder Exercises 


Exercise 1.23 


Consider the functional 


stu] = f dry’V/1+y’, y(0)=0, y1)=B>-1. 


(a) Show that the stationary function is the straight line y(x) = Bx and that the 
value of the functional on this line is Sly] = BVY1+ B. 


(b) Use the binomial expansion to expand the integrand of Sly + eg] to second 
order in € to show that if y(a) = Bz, 


(443) [ 2 ae 
ont aaa or ee a 
sa +B? J, @I) 


Deduce that on this path the functional has a minimum. 


Sly + eg] = Sly] + 
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Exercise |.24 


In this exercise the theory developed in Subsection 1.3.1 is extended. The function 
F(z) has a continuous, non-constant second derivative and the functional S is 
defined by the integral 


b 
Syl = / dz Fly’). 


(a) Show that 


‘ b 2 
i } Ff 
Sly + eg] — Sly] = | dx ays + se | dx 5 ofan} + er); 
where ofa) = g(b) =0. 


(b) Show that if y(x) is chosen to make dF'/dy’ constant then the functional is 
stationary. 


(c) Determine the conditions that make the functional either a maximum or a 
minimum. 


Exercise 1.25 


A uniform, flexible, inextensible chain of length L is suspended between two sup- 
ports having the coordinates (—a, A) and (a, A), with the y-axis pointing vertically 
upwards. Show that, if the shape assumed by the chain is described by the differ- 
entiable function y(x), then its length and potential energy are given respectively 
by 


ee / dn fl +g? aa Sip ee / Pane ee 


a —@ 


where p is the line density of the chain and g the acceleration due to gravity. 


Exercise |.26 


This question is about the shortest distance between two points on the surface of a 
right-circular cylinder, so is a generalisation of the theory developed in Section 1.2.1. 


(a) If the cylinder axis coincides with the z-axis we may use cylindrical polar 
coordinates (p,, 2) to label points on the cylindrical surface, where p is the 
cylinder radius. Show that the Cartesian coordinates of a point (x,y) in the 
plane are given by x = pcos@, y = psing and hence that the distance between 
two adjacent points on the cylinder, (p, ¢, z) and (p,¢+ 6¢, z + 62) is, to first- 
order, given by 


6s? = poo" Ge”. 


-b) A curve on the surface may be defined by prescribing z as a function of ¢. 
Show that the length of a curve from ¢ = ¢; to @ = ¢2 1s 


2 
Llal= fo ad VP aOP. 


(c) Deduce that the shortest distance on the cylinder between the two points 
(p,0,0) and (p,a,¢) is along the curve z = ¢¢/a. 
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Exercise |.27 


An inverted cone has its apex at the origin and axis along the z-axis. Let a be 
the angle between this axis and the sides of the cone, and define a point on the 
conical surface by the coordinates (p,¢), where p is the perpendicular distance to 
the z-axis and ¢ is the polar angle measured from the z-axis, as in Exercise 1.26. 


Show that the distance on the cone between adjacent points (p,@) and (p + 6p, @ + 
dd) is, to first-order, 


dp? 
6s? = 0°60" + as 
sln @ 


Hence show that if p(¢), ¢1 < p < ¢2, is a curve on the conical surface then its 
length is 


2 a! 2 
Lil= | doy/p2 + 
‘ sin“ a 


Exercise |.28 


A straight river of uniform width a flows with velocity (0,v(x)), where the axes 
are chosen so the left-hand bank is the y-axis and where v(x) > 0. A boat can 
travel with constant speed c > max(vu(x)) relative to still water. If the starting and 
landing points are chosen to be the origin and (a, A), respectively, show that the 
path giving the shortest time of crossing is given by minimising the functional 


2 C7 '(e)*) — u(x)? — v(ax)y' (ax 
r= fae VTE EP HW) yo) 20, yan 


c? — v(x) 


Hint: the derivation of this result requires use of the identity 


14+ VE = (1+V¥) (2a VF) = Oe 


31 


32 Chapter | 


Solutions to Exercises in Chapter | 


Solution I.1 
(a) If y=, then y’ = 1 and the functional becomes S = i ge =.1, 


(b) If y=a%, then y’ = ax*~' and the functional becomes 


on i 


Note that the integral does not exist if a < 1/2 because the integrand tends to 
zero too rapidly at the origin. 


1 2 
a 
= o | dxx?t-) = a> 5. 
0 


(c) If y=sin(7a/2), then y’ = (7/2) cos(7a#/2) and the functional becomes 


2 2 


1 9 1 
s=7/ dx cos (2) =F] ds: (1+ cosma) = — 


Solution 1.2 
(a) On this straight line y = 0 so the value of the functional is 5 = J Oak. 
(b) If f? = ba — x? then 2f f’ = b — 2x and the functional is 


(b — 2x)? 

A(ba — A(br — x?) Oe et x) 
Setting « = bsin? ¢, so dx/dd = 2bsindcos@ and a — x) = (bsin¢cos ¢)?, 
gives S|f] = b fi o dd = 7b/2, as is obvious without this analysis. 


Solution |.3 


The line passes through (a, A) and (b, B) so the constants m and c are determined 
from the two equations, 


A=am+c and B=bm-+c. 
Subtracting these gives B — A = m(b— a), giving the required expression for m. 
Substituting this into the first equation gives 
B-A Ab— Ba 
i. ho 
Ifa=band AF B the two points are distinct and there is a straight line joining 


them. This line cannot be represented by an equation of the form y = mz +c, 
however, because on this line x is constant. 


c=A- 


In this case it is necessary to use parametric equation for the line, which is 
g=(b—a)t+a, y=(B-A)t+A, whereO<t<1l, 


and which is valid for all distinct points P, and P,. You are not expected to provide 
this part of the solution. 


Solution |.4 


Since y’ — a) the value of the functional (1.9) 
A xe 3 = _ ¥(o—a)* +(B— A)’ —a)?+(B-— A)? i 
(b— a)? 


a+ (BA). 
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Solution 1.5 


In order to find the stationary function we need to compute the difference 6S = 
Sly + eg] — S[y] only to O(e). However, because we will require the second-order 
term in Exercise 1.8, here we evaluate the difference to O(e*). The difference is 


6S = [ dx (V1 + y!(x) + eg'(x) — V1 + y/(2)) : 


where g(0) = g(1) = 0. But 


eg’ (x ne 
Ve) +e) = vitv@ (14+ “EO 


1+ y'(z) 
2 
eg (x) e ( g' (x) 
= VJ/l+y'(xz)|1+ = Sa Seas tast 1 
ae ( M+y@) 8 \I+y@) 
where we have used the binomial expansion (1 + z)!/2 = 1+ Sz ~ ge +--+, which 


is equivalent to using the Taylor series for (1 + z)1/?. 


Thus to first order in € we obtain 


1 / 

55 = € / g (x) 2). 

S af dx eerie + O(e*) 
The functional is stationary if the first-order term is zero for all g(x), otherwise 65 
would change sign with e. Using the result quoted in the text (after equation (1.19)) 
this gives \/1 + y/(x) = constant, that is y/(xz) = constant and y(x) = ar + 8. The 
boundary conditions, y(0) = 0 and y(1) = B, then give y = Bz for the stationary 
path. With this value for y(x), the integrand is real if B > —1 and has the value 


= oe 2. 


Solution 1.6 
The difference between S[y + eg] and S|[y] is 


2 2 
ey eg) — S|y| = / da x ((y’ a = y’?) = 2¢ | decoy 9! Ole"). 
1 i 


If xy’ = a = constant, this difference is O(€?) and y(x) is a stationary path. 


This differential equation is separable and becomes 
dy 
dx 
But y(1) = 0, so b= 0 and y(2) = 1, so a= 1/1n2, hence the required solution. 


— — with general solution y=b+alnz. 
2 


Solution 1.7 


(a) The required expansion is given by first writing the square root as 


bw oir <=. of)" 
& Ea € — ——— aa ‘ 
lt+a? 1+a? 
2 


Now use the binomial expansion (1 + z)!/? = 1+ 5z-—$27+--- to give 


=143(7 . 29 )-i( a 28 Aes: 


2\14+c sabe we 8 Phi otk 


Fe: & a? 9" 
=1+ face hee” Gr ee na x1 —.] 4. O(e?). 


But 


lta? (1+a?)? (1+a?)?’ 
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so the final result is 


2 aa 
VIF er aP = Vire (14 8 + +018) 


08 
Vita? 2(1 + a?)3/2 


(b) With a= y'(a) and G = g’(x) we see, using the argument described in the 
text, that the term O(c) in the expansion of S{y + eg] — S[y] is zero if y/(x) = 
constant, hence the line defined by equation (1.22) makes the functional sta- 
tionary. With this choice of y(x), a = m and the second term in the expansion 
derived in part (a) is zero and the quoted result follows. The second-order 
term in the expansion of the functional is positive for e 4 0 and all g(x), so 
the functional has a minimum along this line. 


Solution 1.8 
It was shown in the solution of Exercise 1.5 (page 33) that to second order in e, 
1 / 2 1 / 2 
€ g (x) € g (x) 3 
Sly+eg|—S =s/ da $= - 5 | dx ———_—~ + O(e?”). 
With y = Bz the first-order term on the right-hand side is zero by definition, hence 


2 


1 
6S = —-—_—__—_.- jf dzxg'(r)*?<0,;..B > =1. 
S ain, eee ys > 


This term is always negative, so for sufficiently small |e] we have S|y, + eg] < S|ys], 
where y;(x) = Ba is the stationary path, which is therefore a local maximum. 
Solution 1.9 

(a) If Fly’) = (1+ y!2)"/4, then 


a y’ 
dy’ = 2(1 + y! 2)3/4° 


(b) If F(y’) =siny’, then dF’/dy’ = cosy’. 
(c) Since #(e*) = e* we have dF/dy! = F. 


Solution 1.10 
If F(z) =Cz+D, F’'(z) =C and equation (1.34) gives C =a. In this case the 
functional becomes 


b 
Sly] = / dx (Oy!(#) + D) =C y@) =H@)l + DO~ a). 


This depends only upon C’, D and the boundaries a and b: the value of the func- 
tional is therefore independent of the chosen path. 


Solution I.11 | 
In this example F(z,v) = V1+a+v? and equation (1.44) becomes 
v=cV1lt240v?, 
where v = y/(x). Squaring and rearranging this equation gives 
dy\* r ¢ 
(Z) a ee 


The square root of this equation gives dy/dx = taV1+ 2. Since x > 0, y/(x) 4 0; 
also y(1) > y(0), so y’(x) > 0 and we need the positive sign. Now separate variables 
to write the solution in the form 


Yy z 
/ ww =a | duV1+u, 
A 0 
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which includes the boundary condition at « = 0. Integrating this gives the solution 
in the form 


y(z) - A= 4a (1 oo eel 1) 
The value of a is obtained from the boundary condition y(1) = B, ie. 


.-. eA (B— A) es 
3 = =i ee and hence y(x) = A + (23/2 — 1) ((1 + ag / — 1) : 


Solution 1.12 


We have 
ee sin(z + y) =cos(x+y) and a sin(a + y) = cos(x + y). 


Note that for this function OF /Ox = OF\/O0y. Also, 


dF, OF, OF 
= + =<y! = (1+ y') cos(x + y). 


dx Ox Oy 
For F5 we have 

OF 

Ox 


OF: 
= —sin(x + y’), = = —2ysin(x + y*) 


and 


dF > OF» OF, ih * 2 
fe tw Dy (1 + 2yy’) sin(x + y*) 


For F3 we have 


OF OF 3 dks / / 
— = yF —— = 2F: d —= i= Fs. 
Sia uFs, Gi=aP and Fo =(ytay)Fs= (ey) 
For the last part, we note that the left-hand side is zero since OF, /Oy’ = 0. But 
dF O (dF 
— =(1+y')cos(x+y), so By! () = cos(% + y). 


Solution |.13 


The partial derivatives are 
OF ¥ OF OF y’ 


— i, = Oh, Ae See, 
Ox J x2 + yl?’ Oy Oy’ \/ x2 + y!? 
and the total derivative is 
a ae a xr ya” 


de On Oy" * By” Jat pyt atau? 
Hence 

d (OF yl"! ry’ y! zy" 

iz (ay) ~ rege I VT 


and, using the above expression for dF'/dx 


O dF xy y" y!? // 
dy! ‘= ~ G+ yl BP * Jar ty? (a +y'?)9/? 
Solution |.14 


The partial derivatives are 


OF x OF y'? OF yy’ 


Ox far+yyl2’ Oy 2y/a? + yy’? Oy! \/a2 + yy’? 

The total derivative is 
dt’ - OF aa le 4 OF Le a yyy” 
dx Ox Oy 


Y 
et 
Oy’ Jaz + yy? — 2/a? + yy’? [2 + yy! 2 
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Solution 1.15 
The chain rule applied to a function G(x, y(x), y’(x)) has the form 
dG _ AG dy, AG dy , a6 
dx Oy'dx Oydx Ox’ 
In this example, where G = OF'/0y’, this expression becomes 
d (AF) _ 0 (dF) dy | 0 (OF) dy , a (OF 
dx \ Oy’) Oy’ \ Oy] dx Oy \ Oy] dx Ox \ Oy’ 
EE gi PE og re 
Oy! a7 Oy! Ay” OxOy!’ 


which gives the required expression and is the left-hand side of the inequality. 


The right-hand side of the inequality is 


0 (=) 0 (= Or | ar 3 
ie ms + eee 


dy Vdc) dy’ \Ox * Oy” * dy! 
ae OF eee 
~ Ondy | Oy | OyOy'” | Byl2” 


which differs from the left-hand side by the term OF'/Oy. Thus if F’ is independent 
of y, the derivatives are equal. 


Solution 1.16 
Subtract the term OF'/Oy to obtain the required result. 


Solution |.17 

(a) Direct differentiation gives 
OF OF f 
—=vVl+y"?, = =e 
Oy Oy /1+y'2 

Differentiating the second expression gives 

O° F ———— y Fr y 
Oy!2 Ji +y!2 (1 + y!2)3/2 - (1 + y'2)3/2 


Using the expression derived in Exercise 1.16, namely 


/ 


d (OF\ OF ,@F ,@F OF OF 
°= ae ay) ~ dy ~Y ay? TY Byay By" OP Beay! ~ 
we obtain 
yy” yg” aig 
~ Oy? * eye tH) 
1 2 
= a y357 (yy"” + (1+y'7) —_ (1+y'7) ) 
1 2 
= as yan wv" 1), 
But 
a(4)-4-4 
ae y+ 
giving 
d if 
w"-y?="5 (4), if y #0, 
hence 


a OF dee. ee ol y a 
dx \ Oy’ Oy (1+ y'?)3/2 7 dx y j 


dydy’ I+ yl? 
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(b) If the left-hand side is zero we have 


ote aiyv 
esl he as | 2 a J ma | 
rE(5) 4 wvz(*) 


Now define z = y'/y and consider z to be a function of y, so in the following Note that this is possible 
z' = dz/dy. Now put the second equation in the form yz z’(y) = 1, which can. because x may be 


be integrated directly to give considered a function of y, 
so y’/y can be expressed in 


terms of y. 


for some constant C’. Hence, since z = y’ ; Y, 


dy _ 3 


giving 
Fr oos 
a 1 


Finally, set Cy = cosh ¢ to give ¢ = C(x+ D), i.e. y = (1/C) cosh(C(x + D)), 
which is the required solution if C = A and CD = B. 


c+ D. 


Solution 1.18 
(a) yi) = o*, then 9 = 22 and Dy) = 2. 
If y(x) = cosa, then y’ = —msinwz and Dy] = 0. 


(6) Hala) = 2, then: 
1 
taeh=2*, Ky = / dx a(x* + 2x) = 3; 
0 
1 
fyis)=cosone, Kly\ = / dx x cos Tr = —~5. 
0 
(ec) E ele) = «and Oz) = 1, them 


ige=c, Lgh= [2x4], +f delagty = a: 


1 
if y(x) =cosmz, Lly] = |—Zasin Qna], +f dx (—rxsinrx + cos7xz) = —1. 
0 


Solution 1.19 
Consider the left-hand side, 0F'/Oy’ = F and dF'/dx = (ay’+ y")F. Thus 


a for ee oe {ar row 
= (se) = (wv +y")F and wa (Ge) =0F + ay +o 


Solution |.20 


The derivative of f(x) is f’(x) = 2/./x?2 + h? — (d—2)/\/(d— 2x)? + h3. Since 


AR x RB a + 2 


sind, = ——- = ————=-_ and_=Ss sin 6g = —- = ———————_., 
© SRO PTR RO” (d= a +3 
where the distances are defined in Figure 1.10 (page 24), we see that the distance 


travelled by the light is stationary when sin @; = sin 62, that is 0; = 62. Further, 
since 


Ge Re haa ap eee 


the stationary point is a minimum. 


0, 
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Solution |.21 
Observe that 


b 
ied ed +f Ce a ee: ae 


that is, the values of the two functionals differ by a constant, independent of the 
path. Hence the stationary paths of the two functionals are the same. 


Consider the difference 6 = S2[y + eg] — S2ly], where g(a) = g(b) = 0: 


b 
§ = 2e | dx xy’ (x)g' (x) + O(e?) 


so that 6 = O(c?) if xy'(x) =c, where c is a constant. Integrating this equation 
gives y(x) = d+ cln(a#/a), where d is another constant. The boundary conditions 
now give 


A=d and B=d+cln(b/a) 


and hence 


y(t) =A+(B-A) aire | 


Solution 1.22 


In this example F(y') = (1 + y/?)'/4, and is a function of y’ only, so the stationary 
path is a straight line, y= mx+c, with the constants m and c being chosen to 
satisfy the boundary conditions. These give y = Bz. Also, 

OF y’ q vr 2-7" 

— = —————_., and — > = ——— 7; 

Oy’ 2(1 + y!2)3/4 Oy! ? A(1 + y!2)7/4 
Using the result derived in Exercise 1.24, and the fact that y’ = B on the stationary 
path, we see that 
(2 — B*)e? 


1 
7m | dx g'(x)* + O(e*). 


Sly + eg] — Sly] = 84 B24 


Thus if B < 2 the difference is positive for all g(a) and e, if sufficiently small, so 
the functional is a minimum along the line f(x) = Bx. For B > V2 the difference is 
negative and the functional is a maximum. If B = 2 the nature of the stationary 
path can be determined only by expanding to higher order in e. 


Solution 1.23 


(a) We need the difference 6 = S[y + eg] — S[y], where g(0) = g(1) = 0, otherwise 
g(a) is an arbitrary continuous function. Now, using the binomial expansion 


ED a 
Vl+a+e= = + 37 +a) a - 25 +0), 


SO 


= ED ep 
(a+ €B)/l+a+e8= Qu ViFa(1+ 50 2) St) 


+ eva (14 ea ae te) 


6B(2+3a) _ 6°(4+3a) 
=aVT+ a4 a 4 et 
A he, ae 8(1 + a)3/2 
Now substitute a = x’ and 3 = g’ to obtain 
1 / 2 i / 
oroe 4 € / | 3 
6=e | dx ———g(xr)+— | de — 359 (2) + Ole). 
fear O+ SL Maa ypaee” +O8) 
If y(x) is a stationary path of S, then the term O(e) is zero. Since g(0) = 


g(1) = 0 it follows, as in the text, that y’(x) = constant is a possible solution. 
Since y(0) = 0 and y(1) = B this gives y(x) = Ba and Sly] = BV1+ B. 
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Solutions to Exercises in Chapter | 


Alternatively, using equation (1.34) (page 16), with F(y’) = y’/1+y’, we see 
that the stationary path is given by F’(y’) = constant and hence y’ = constant, 
that is y= mx+c: since y(0) = 0 and y(1) = B this gives y(x) = Ba. 
(b) On substituting Bax for y(a) we see that 
f4+am: f° 
f= eee. | dag ar $0 eh. 
camer | date) +O(€) 
Then, provided B > —1, 6 is positive and the functional is a minimum on the 
stationary path. 
Solution 1.24 


(a) Consider the difference 6 = Sly + eg] — S[y] where g(a) = g(b) = 0, so we need 
the ‘Taylor expansion 


b b 2 
FP 1 d*F 
6= | dx ay? + ‘s dx aye + Ofe*). 


(b) IfdF/dy' = constant then 6 = O(e*) so S{y] is stationary. Further, if dF/dy’ = 
constant then, provided F(z) is not a constant or a linear function of z, y’(x) 
is also a constant. 


(c) On the stationary path y'(a) is a constant and hence d?F’/dy’” is constant and 


Lyger bene 3 
fe he a7 | dx g(x) + Ole 4 
The integral is positive, so 5 is positive or negative when d?F/dy’? is positive 
or negative. That is, S[y] is either a minimum (d?F/dy’? > 0) or a maximum 
(d?F/dy’* <0). If d?F/dy’? =0 the nature of the stationary path can be 
determined only by expanding to higher order in e. 


Solution 1.25 


The functional L[y] is the length of the chain, given by the integral (1.9) (page 9). 
The potential energy, dV, of an element of the rope of length ds centred on a point 
x is given by mass x height x g, i.e. 6V = (pds)y(x)g. Since 6s = \/1+ y/76zx this 
gives the length of the chain as 


Lul= f dzvV/1+y!? 


=, 


and the total potential energy as 


Ely| = po | dzy/1+y"*. 


=e 


Solution 1.26 


(a) The cylinder radius, p, is constant so, to first order, dz = —pddsin @ and dy = 
pdo@ cos d, the distance is 


: 
6s? = 6x” + dy” + bz? = p*6d? + 52” = 5¢? ( sa @ 


(b) The length along a curve is just the sum of the small elements which in the 
limit 6¢ — 0 becomes the integral 


p2 
La = [ db (P+ 2G. 
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(c) The functional L[z] is the same as that considered in Section 1.3.1 hence its 
minimum value is given when z(@) is a linear function of ¢. The boundary 
conditions give the result quoted. 

Solution |.27 

The Cartesian coordinates of a point (p,@) on the cone are 
(x, Y, z) sa (p COS Q, psin d, p/ tan a) 


and for the adjacent point at (9 + 6p,¢+ 60), or (x + 6x, y + dy, z + 6z) in Cartesian 
coordinates, we have, to first order 


dx = dpcosé— podsing, dy=dpsind+ pddcosd, 6z = dp/ tana. 


The distance between the two adjacent points is therefore 
bp? t. eee 
| 5p? + p26¢? = —E— + p?6¢? = ( +5 @ bd. 
sin* @ 


Hence the distance between the points ¢, and @¢2 along the curve p(¢@) is 


2 pl? 
L|p| =| do 4/ p? + —>—. 
; sin* a 


Solution 1.28 


Let the velocity of the boat relative to the water be (uz, uy), where c? = uz + uy, 
and we assume that wu, is positive. The velocity of the boat relative to land is 
therefore (uz, u(x) + uy). If the path taken is y(x) it follows that 


6s? = (1+ 


tan? a 


2 


dy dy dx Uytv 

dx dt’ dt je. 
Also, the time of passage is 
* oe 


Qo Us 


T|y| = 


4=u+u we 


Now we need an expression for u, in terms of y, v and c. Since c a 


have 


(y'(x)uz —v) =? -v 


which rearranges to the quadratic 

(1+y'7) uz — vy’ uz — (c? — v*) =0, 
having the solutions 
_ oy 2 Veg Pe eee) 
<= ere 


Because c > v this equation has one positive and one negative root. We need the 
positive root: 


vy! + vy (uy’)* + (ec? — v*)(1 + y"*) 
1 4. y! 2 : 
This may be put in a more convenient form by noting that, for any X 


1+ VX = (1+ vX) (ta vF | = 


Uy = 


Hence 
rn 
giving 


pe . oa 


a 1\2 2 — g2)\(] 2h ce / a 2(1 2. ey / 
riy)= [ae (oy? + (8 =) + y?) we Pas She uy 
0 0 
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CHAPTER 2 
The Euler—Lagrange equation 


2.1 Introduction 


In this chapter we derive the most important result of the calculus of vari- 
ations and apply it to a few of the examples described in Chapter 1. We 
show that for the functional 


b 
sul= | dz F(x,y,y'), y(a@)=A, yb) =B, (2.1) 


where F'(x,u,v) is a real function of three real variables, and a necessary 
condition for the function y(x) to be a stationary path is that it satisfies the 
equation 


d (OF OF 
Sy ee Pee —A b)=B od 
£(S)-F-=0 wa =A, 0) =B (2.2) 
where the conditions y(a) = A, y(b) = B are named as the boundary con- 
ditions. This equation is known as either Euler’s equation or the Euler— 
Lagrange equation. 


In order to derive the Euler-Lagrange equation it is helpful to first dis- 
cuss some preliminary ideas. We start by describing briefly Euler’s original 
analysis, because this provides an intuitive method and indicates why vari- 
ational problems are different from those of conventional calculus, and also 
provides a link between the calculus of functions of many variables and the 
calculus of variations. This leads directly to the idea of the rate of change of 
a functional, needed in subsequent analysis. Next we prove the fundamental 
lemma of the calculus of variations, which is essential for the derivation of 
the Euler-Lagrange equation. 


In the last two sections of this chapter, before the end of chapter exercises, 
we apply this theory to two of the problems described in Chapter 1: the 
surface of minimum area and the brachistochrone problems. ‘There are many 
exercise in this chapter; you should attempt as many as time permits, but 
remember that it is unnecessary to attempt them all. 
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2.2 Preliminary remarks 


2.2.1 Relation to differential calculus 


Euler was the first to make a systematic study of problems that can be de- Leonhard Euler, 
scribed by functionals, though it was Lagrange who developed the method 1707-1783. 

we now use. Euler studied functionals having the form defined in equa- Joseph-Louis Lagrange, 
tion (2.1). He related these functionals to functions of many variables using 1736-1813. 

the simple device of dividing the abscissa into N + 1 equal intervals, 


d= 20, 21, 29; +115 BN; DtNAi1 = i, (2.3) 


where 14.41 — 2, = A, and replacing the curve y(x) with segments of straight 
lines having the vertices 


(x0, A), (1,41), (%2,y2), ---, (tn, yn), (@N+1, 8B), (2.4) 


where yz, = y(xx), aS Shown in Figure 2.1. 


Figure 2.1. Diagram showing the rectification of a curve y = f(x) by a series of six 
straight-line segments (N = 5). 


Approximating the derivative at x, by the difference (yz — yz—1)/h, the 
functional (2.1) is replaced by a function of the N variables (y1, y2,..-, yn); 


N+1 
Stunts) =F (sem ASB), (2.5) 


where h = (b—a)/(N +1), yo = A and yn41 = B. This association with 
ordinary functions of many variables can illuminate the nature of functionals 
and, if all else fails, it can be used as the basis of a numerical approximation; 
examples of this procedure are given in Exercises 2.1 and 2.22. The inte- 
gral (2.1) is obtained from this sum by taking the limit N — oo; similarly, 
the Euler-Lagrange equation (2.2) may be derived by taking the same limit 
of the N algebraic equations 0S/Oy, = 0, k = 1,2,...,N. In any mathe- 
matical analysis, however, care is usually needed when such limits are taken 
and this example is no exception; we shall not dwell on these difficulties, 
although they are important in the further development of this subject. 


Euler made extensive use of this method of finite differences. By replacing 
smooth curves with polygonal lines, he reduced the problem of finding sta- 
tionary paths of functionals to finding stationary points of a function of N 
variables; he then obtained exact solutions by taking the limit as N — oo. 
In this sense functionals may be regarded as functions of infinitely many 
variables — that is, the values of the function y(x) at distinct points — and 
the calculus of variations may be regarded as the corresponding analogue of 
the differential calculus. 


2.2 Preliminary remarks 


Exercise 2.1 


If the functional depends only upon y’, 


a) 
Slyl = / dz Fly’), y(a) =A, y(b) = B, 


show that the corresponding approximation defined by equation (2.5) becomes 


si = ee 
Siu.gew)=h|P (24) +r (BH) 4 +e (B a. | 


uN — i B= yy 
+ lil ia teal 2 


Show that a stationary point of this function satisfies the equations 


OS Yk — Yk-1 Yk+1 — Yk 
—n = f | SH |) Le Pee eee eS ae oy ees Be 
Oyy ( h ( h OF y) 9 9 


Use these to deduce that 
ae (Yr — yx—1)/h) =@ b= 7... | 


where c is a constant, independent of k, and provide arguments to show that, pro- 
vided F’(z) is continuous, $(y1, y2,---, yn) is stationary when the points (xz, y(@x)) 
lie on a straight line. 


2.2.2 The differentiation of a functional 


The calculus of variations is primarily about finding stationary paths of 
functionals. The subtle mathematical ideas necessary for the rigorous de- 
velopment of the subject are beyond the scope of this introductory course. 
However, you ought to be aware of some of the potential problems, and in 
the next few paragraphs we try to explain some of these difficulties, which 
are not an assessed part of the course. 


Consider for a moment the analogous problem of finding the stationary 
points of a function, f(z), of a real variable x: it is evident that the concept 
of closeness of two points on the real line and of two values of the function 
are essential. In this case these measures are the same, namely |a — 6] for 
the distance between two real numbers. 


For a function f(a) of n real variables x = (41, %2,...,%m), we need the no- 
tion of the distance between two points in the n-dimensional space of vari- 
ables. Usually, the numbers (71, 22,...,2%n) can be regarded as the Carte- 
sian coordinates of a point in an n-dimensional space, then the distance, 
\z|, of a point 2 from the origin can be defined by the natural extension of 
Pythagoras’ theorem, 


la) = \/a? +a$+---+ 22, (2.6) 


and the distance between two points is |ja — y|. However, alternative defini- 
tions of distance can be used: two are 


Jarl, = [eal + |z2]+-+++|an| and |x|y= max (ail). (2.7) 


= a) ee) 
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Functions, such as these, that define distances in abstract spaces are called 
norms: any function N(a) of the coordinates can be used as a norm provided 
it satisfies the following three rules: 


(a) N(w) > 0 and N(x) = 0 if and only if x = 0. 

(b) Nia —y) = N(y— a). 

(c) N(a—y)+ N(y—z) > N(a — z) (triangle inequality). 

It can be verified that the three norms defined above satisfy these rules. 


Returning to functionals, now the elements corresponding to the points x 
are functions, y(x), belonging to a particular class. For functionals of the 
form 


b 
/ dx F(a,y,y'), y(a) = A, y(t) =B, (2.8) 


we would require functions, y(x), with continuous first derivative that satisfy 
the boundary conditions. One of the crucial differences between the point x 
and a function y(x) is that the former belongs to a finite-dimensional space, 
in the sense that there are at most n independent directions. Functions, 
however, belong to an infinite-dimensional space: this step, from finite- to 
infinite-dimensional spaces, is subtle and important in the development of 
rigorous theory of the calculus of variations, but is not considered in this 
course. 


For the same reason that we need a norm in an n-dimensional Cartesian 
space, we need norms for the distance between allowed functions. There are 
several different norms for functions, and the choice of an appropriate norm 
can be important. An example of a norm for functions continuous on |a, 
is 


|y(2)|| = max |y(c)] (2.9) 


But continuous functions may not be differentiable at some points — or 
indeed at any point — and we need functions that are both continuous and 
have continuous first derivatives. A suitable norm for such functions is 


SS ii 
lly(a) || = =o ly(a)| + 4 ly"(x)|. (2.10) 


In the rigorous analysis of variational principles, the choice of norms is im- 
portant, particularly when trying to determine whether or not stationary 
paths are extrema. In this course we have no need to consider norms any 
further, although the notion is implicit in all that follows. 


It is necessary, however, to restrict the class of functions to a subset of all 
possible functions that satisfy the boundary conditions. Normally we shall 
simply refer to this restricted class of functions as the admissible functions: 
in most circumstances admissible functions are defined to satisfy the bound- 
ary conditions and have continuous first, or possibly higher, derivatives. 


If you are interested in knowing why these details are important, consider 
the optional Exercise 2.37 (page 74) which gives a simple example showing 
why care is needed. Further, in Section 2.5, in particular equation (2.59), we 
shall encounter a real problem for which the path producing the minimum 
value of the functional does not have a continuous first derivative. ‘These 
subtleties are not, however, an assessed part of the course, or required in 
the theory presented here. 


We also need the notion of the rate of change of a functional, because this 
is implicit in the idea of a stationary path. Recall that a real differentiable 
function of n real variables G(a), for # = (71, %2,...,%n), is stationary at a 
point if, at this point, all its first partial derivatives are zero, i.e. OG/Ox, = 0, 


2.2 Preliminary remarks 


k= 1,2,...,n. This result follows by considering the difference between the 
values of G(a) at adjacent points: 


Cle ee Sa 5— +01), él =1, (2.11) 
| | 


where € = (€;,&5,...,&,). The rate of change of G(a) in the direction of € 
is obtained by dividing by e, 

Gee+ £2) = Ce). ~4 ee 

ae — +O 2.12 

Z df Ox, + (€), ( ) 

and taking the limit as e — 0. A stationary point is defined to be one at 
which the rate of change of G is zero in every direction; it follows that at a 
stationary point all first partial derivatives must be zero. 


This idea, embodied in equation (2.12), may be applied to the functional 


b 
Sly] = / dx F(a,y,4'),  y(a) = A, y(d) =B, (2.13) 


which has a real value for each admissible function y(x). The rate of change 
of a functional S[y] is obtained by examining the difference between the 
values on neighbouring admissible paths, i.e. Sly + eg] — Sly]; and, since 
both y(a) and y(a) + eg(x) are admissible functions for all €, it follows that 
g(a) = g(b) = 0. This difference is a function of the real variable €, so we 
define the rate of change of S{y] by the limit 


S —S§ d 
ASly, g| = lim wl Pasha a. ees ae (2.14) 
eal) c de EL 
which we assume exists. The functional AS depends upon both y(x) and 
g(x), just as the limit of the difference [G(a + «€) — G(ax)|/e, of equa- 
tion (2.12), depends upon x and €. 


Definition: The functional S|y] is said to be stationary if y(a) is an admis- 
sible function and if AS|y,g| = 0 for all g(a) which have a continuous first 
derivative and satisfy the boundary conditions g(a) = g(b) = 0. 


This definition generalises the particular example dealt with in Subsec- 
tion 1.2.2, in particular see equation (1.17) (page 11). 


As with functions of many variables, the nature of a stationary path of 
a functional is usually determined by the term O(e?) in the expansion of 
Sly + eg]. The theory that determines whether a stationary path is an 
extremum is difficult and is not contained in this course. 


In all our applications, the limit in equation (2.14) can be evaluated by the 
formula 


d 
de €=0 


Also, in our applications, AS is linear in g. That is, if c is any constant, 
then AS|y, cg] = cAS|y, g]; in this case it is called the Gdteaux differential. 


ASly, 9] = —Sly + €g] (2.15) 


As an example, consider the functional 


b 
Sly] = } dx /I+y”,. y(a)= A, y(b) = B, (2.16) 
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for the distance between (a, A) and (b, B) as discussed in Subsection 1.2.2. 
We have 


er re=d [ae 1+ (y’ + 9’)? 

gee y’ + eg 
d 

-[ da V1 + (y' + eg')° 


— yl@) tea), 
= fa OF? He). age 


Note that we may change the order of differentiation with respect to « and 
integration with respect to x because a and b are independent of € and all 
integrands are assumed to be sufficiently well-behaved functions of x and e. 
Hence, on taking the limit as e — 0, we have 


b / 
ASly,9] = lim {sly + eq = [ ds “2 —9', (2.18) 


Vit yay 


which is just equation (1.18) (page 11). 


Exercise 2.2 


Find the Gateaux differentials of the following functionals. 
1 2 x/2 

(@) Stl=f dey? —() Stul= f dea*y? —(@) Sl= f ae(y?-v") 

0 1 0 

y! 2 b 

fd) Sig] = [ ee b>a > (e) sul | dx (y'* + y* + 2ye”) 
©) Sty) = fae Ve revise 

0 


Exercise 2.3 


Find the Gateaux differentials of the following functionals. 


b b 
i Siok / ae ee / rug” 


b 
3, ae / de (1+ ay!) 


Explain why AS2 = AS3. 


A comment 


For our final comment of this section, we consider the approximation de- 
fined in equation (2.5) in a little more detail to show the sort of problem 
that can arise in the limit as N — oo. By associating the variable €, in equa- 
tion (2.12), with the discrete values of the function g(x) of equation (2.14), 
so € = (91,92,---,9N), Where gp = g(r), we see that the condition |€| = 1 
includes the requirement that g(x) is bounded for a < xz < 6. However, we 
also require that g’(z) exists, but the example gy, = —1/V2, ge+i = 1/ V2 
and g; = 0 for all other components, so that g'(r%) ~ (gk+1 — gx)/h, shows 
that although g remains bounded as h — 0, its derivative is unbounded. ‘This 
example shows why some care is required when we take the limit N — co 
of equation (2.5), why the cases of finite numbers of variables is different 
from the infinite case and why the choice of norms, discussed above, 1s 
important. Nevertheless, provided caution is exercised, the analogy with 
functions of several variables can be helpful. 
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2.3 The fundamental lemma of the calculus of variations 


2.3 The fundamental lemma of the 
calculus of variations 


This section contains the essential result upon which the calculus of vari- 
ations depends. Using the result obtained here, we shall be able to use 
the stationary condition that AS|y,g] = 0, for all suitable g(x), to form a 
differential equation for the unknown function y(2). 


The fundamental lemma 


If z(x) is a continuous function of x for a < x < b and 


b 
/ dz zit igie)=0 (2.19) 


for all functions g(x) that are continuous for a < x < b and zero at 
#=aand ¢ = b, then z(n>=) fora 2d. 


In order to prove this, we assume, on the contrary, that for some 77 satisfying 
a<n<b, z(n) #0. Then, since z(a) is continuous, there is an interval 
[v1, £2] around 7 with 


Gijon < ty Hh (2.20) 


in which z(a) 4 0. We now construct a suitable function g(x) that yields a 
contradiction. Define g(x) to be 


(t—21)(t2-4), axa<r<mwy<bd, 
g(z) = 


7 | 
0, otherwise, ( ) 


so g(x) is continuous and 


b LQ 
/ dz az\gia) = / dx z(x)(x — 41)(xq — x) £0, (2.22) 


1 


since the integrand is continuous and non-zero on [x1, £2]. However, accord- 
; b m © 
ing to equation (2.19), {| dx zg = 0, so we have a contradiction. 


The assumptions that z(#) is continuous and z(x) 4 0 for some x € (a,b) 
lead to a contradiction and it follows that z(a) = 0 for a < x < b; because 
z(x) is continuous, z(x) = 0 fora <a <b. This result is called the funda- 
mental lemma of the calculus of variations. 


This proof assumed only that g(x) is continuous and made no assumptions 
about its differentiability. In previous applications g(x) had to be differ- 
entiable for x € (a,b). However, for the function g(x) defined above, g’(x) 
does not exist at 7; and x2. The proof is easily modified to deal with this 
case. If g(a) needs to be n times differentiable, then we use the function 


g(x) = (x — 2 ills - a 4, Sat. Se Mas 


ae 
0, otherwise. aa 
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2.4 The Euler—Lagrange equation 


This section contains the most important result of this block: if F(x,u,v) 
is a sufficiently differentiable function of three variables, then a necessary 
condition for the functional 


b 
sul = | dz F(x,y,y'), y(a) =A, y(o) =B, (2.24) 


to be stationary on the path y(), is that it satisfies the differential equation 
and boundary conditions, 


d F PF 
oo (=) — > =, igfal = A, wh) =a (2:25) 
This is called Fuler’s equation or the Euler-Lagrange equation. It is a 
second-order differential equation, as shown in Exercise 1.16 (page 19), and 
is the analogue of the necessary condition 0OG/Ox;, = 0, k =1,2,...,n, fora 
function of n real variables to be stationary, as discussed in Subsection 2.2.2. 
We now derive this equation. 


The integral (2.24) is defined for functions y(x) that are differentiable for 
a<a<b. Using equation (2.14), we find that the rate of change of S|y] is 


d b 
ASiy,9\ = | dx F(x,y + €g,y' + €g') 
a e=0 


(2.26) 


8) 
d 
/ dx —F(x,y+eg,y' + 9’) 
- de — 


The integration limits a and b are independent of €, and we assume that the 
order of integration and differentiation may be interchanged using Leibniz’s 


rule. Using the chain rule we have See Block 0, Chapter 2. 
d OF OF 
—f =g—+97— aad 
qe ery + GY a ee 95, +9 a7 (2.27) 
where the right-hand side is evaluated at (x,y, y’). Hence 
b 
OF OF 
asiy.a= [ax (oF +0'@55 ) (2.28) 


The second term in this integral can be rewritten using integration by parts, 
to give 


b b b 
OF OF d {OF 
[aestersy = [oap],- f t0 a (ay) wa 
But g(a) = g(b) = 0, so the boundary term on the right-hand side (that 
is, the first term) vanishes, and the rate of change of the functional S|y] 


becomes 


asi [al ree 2.90) 


If S[y] is stationary then, by definition, AS[y, g] = 0 for all functions g(z). 
It follows from the fundamental lemma of the calculus of variations (see Sec- 
tion 2.3 and equation (2.19)) that y(x) satisfies the second-order differential 


equation 
d (OF OF 
cman —e —S= ————————-—— = —= * a 


If real solutions exist, they provide stationary paths of S[y], but these are not 
necessarily maxima or minima of the functional. It is important to note that 


2.4 The Euler-Lagrange equation 


the Euler-Lagrange equation is, in most cases, a second-order, nonlinear, 
boundary-value problem, with possibly no solutions or many solutions. ‘The 
properties of nonlinear equations are quite different from those of linear 
equations. 


The Euler-Lagrange equation is a second-order differential equation. But if 
the integrand does not depend explicitly upon x, so has the form 


b 
Siyj= f aeGiy.y), ya) =A, vb) =B (2.32) 


it can be shown that the Euler-Lagrange equation reduces to the first-order 
differential equation 


yn —-Ga=e, y(a)=A, y(d) =B, a) 


where c is an arbitrary constant determined by the boundary conditions; see 
the example below. The expression on the left-hand side of this equation is 
often called the first integral of the Euler-Lagrange equation. This result 
is important because, when applicable, it saves a great deal of effort, since 
it is usually far easier to solve the lower-order equation. Two proofs of this 
result are provided in this course. The first involves deriving an algebraic 
identity and we suggest doing this yourself in Exercise 2.6. The second proof 
is given in Subsection 3.5.1 and uses the invariance properties of the inte- 
erand G(y,y'). In Chapter 4 we show that for simple mechanical systems 
equation (2.33) is sometimes equivalent to conservation of energy. A warn- 
ing, however: in some circumstances, a solution of equation (2.33) will not 
be a solution of the original Euler-Lagrange equation — see Exercise 2.5 and 
also Section 2.5. 


Another important consequence of this simplification is that the stationary 
function, the solution of (2.33), depends only upon the variables u = x — a 
and b—a (besides A and B), rather than x, a and 6 independently, as is 
the case when the integrand depends explicitly upon «x. A specific example 
illustrating this behaviour is given in Exercise 2.4. 


As an example, consider the functional 


1 
Sly] = / dx(y'’—y), y(0)=90, yl) =1. (2.34) 
In this case G(y, y’) = y/* — y and the Euler-Lagrange equation is the linear 
equation 
pf) an 2 (0) =, gp=1 (2359 
dxr2 en aN Y ee Yy i Ca 


The general solution of this equation is y = —x*/4+ Ax +B, where A and 
B are constants: the boundary condition at x = 0 gives B = 0 and that at 
z= 1 gives A=5/4. Hence y = z(5 — x)/4 is the stationary path. 


Applying the first integral, equation (2.33), to equation (2.34) gives the 
nonlinear equation 


dy \? | 
(#) ty=c, y(0)=0, y(1) =1, (2:36) 
dx 
which is separable and can be rearranged into 
d 
se (2.37) 
Cc—Yy 


so integration gives 


2Jje-—4y = LS Fs, (2.38) 
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for some constant D. The boundary condition at x = 0 gives 4c = D? and 
that at x = 1 gives D = +5/2, both giving the same solution as before. 


In this example it is easier to solve the second-order Euler-Lagrange equation 
than the first-order equation (2.33), which is nonlinear. Normally, both 
equations are nonlinear and then it is easier to solve the first-order equation. 
In the examples dealt with in Sections 2.5 and 2.7 it is more convenient to 
use the first integral. 


An observation 


You may have noticed that the original functional (2.24) was defined on 
the class of functions that have just one derivative. More precisely, it re- 
quires that F(x, y, y’) is integrable, which is a weaker condition, though this 
difference is not important for this course. However, the Euler-Lagrange 
equation (2.31) requires the stronger condition that y/"(x) is continuous. A 


theorem due to the German mathematician du Bois-Reymond shows that Emil Heinrich du 
if: Bois-Reymond, 1818-1896. 


(a) y(x) has a continuous first derivative, 

(b) 

(c) F(ax,u,v) has continuous first and second derivatives in all variables, 
(d) 0? F/dy'? £0 fora<a <b, 


then y(x) has a continuous second-order derivative and satisfies the Euler— 
Lagrange equation (2.31). (You will not be assessed on this result.) 


AS'|y, g| = 0 for all admissible g(x), 
FB 


Exercise 2.4 


Consider the functional 
b 
Sly] =| dx (y’*+y), y(a)=A, y(b) =B. 


(a) By forming and solving the Euler-Lagrange equation (2.31), show that the 
stationary path is 


B-A 
y= qu? + (FE -10-a) usa, u=2-4a. 
a 


(b) By making the change of variable u = x — a and defining Y(u) = y(x(u)) = 
y(u +a), use the chain rule to show that y’(x) = Y’(u) and that the functional 
becomes 


siv|= fav? -y?), Y(0) =A, Y(b—a)=B. 


Deduce that the stationary path depends only upon u = x — a and b — a (be- 
sides A and B). 


Exercise 2.5 


(a) Show that provided G,(y, y’) exists when y’ = 0, the differential equation (2.33) 
has a solution y(x) = 7, where the constant ¥ is defined implicitly by the equa- 
tion G(7,0) = —c. 


(b) Under what circumstances is the solution y(z) = y also a solution of the Euler- Note that this part of the 
Lagrange equation (2.31)? exercise is harder. 


2.5 Minimal surface of revolution 


Exercise 2.6 
If G(y, y’) does not depend explicitly upon x, that is OG/0x = 0, show that 


dy (# (08) _ 20) _ 4 (8G 
Vo (4 Oy! dy) dx 7 By! aid 


Hence show that equation (2.33) is true if either (a) y(a) satisfies the Euler— 
Lagrange equation, or (b) y(x) is constant. 


Exercise 2.7 


Show that the Euler-Lagrange equation for the functional 


xX 
Sly] = / dll? ae ee >, 


is y’+y=0. Hence show that, provided X ¢ nz, n = 1,2,..., the stationary 
function is y = sina/ sin X. 


Exercise 2.8 


(a) Show that the Euler-Lagrange equation for the functional 


Sly] = / dx [y’? +y?+2ay], y(0) =0, y(1) =a, 


is y’ —y =z and hence that a stationary function is 


sinh x 
sinh 1 


(b) By considering the term O(e*) in the difference Sly + €] — Sly], show also that 
this solution makes the functional a minimum. 


y(x) = (1+) 


2.5 Minimal surface of revolution 


The problem is to find the function y(z), with given end points y(a) = A 
and y(b) = B, such that the area of the surface formed by rotating the curve 
y(x) about the z-axis is a minimum. The left-hand diagram in Figure 2.2 
shows the construction of this surface. 


Figure 2.2 Diagram showing the construction of a surface of revolution (left), and 
the small segment used to construct the integral in equation (2.41) (right). 


This section is divided into three parts. First, we derive the functional S|y] 
giving the required area. Second, we derive the equation that a differentiable 
function must satisfy to make the functional stationary. Finally, we solve 
this equation in a simple case in order to show that even this relatively 
simple problem has pitfalls. 
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2.5.1 Derivation of the functional 


An expression for the area of this surface is obtained by first finding the area 
of the edge of a thin disk of width 6x, as shown in the right-hand diagram 
in Figure 2.2. The small segment may be approximated by a straight line 
provided that oz is sufficiently small, so its length 6s is given by the analysis 
in the previous chapter leading to equation (1.7) (page 9), 


= /1+y'(x)* da + O(62). (2.39) 
The area 0S traced out by this segment as it rotates about the z-axis is 


6S = 2ry(x) 6s = 2ry(x)V/1 + y'(x)2 da + O(5x) (2.40) 


Hence the area of the whole surface from x = a to x = b is 


b 
Siy| = an dxy(x)/1+y'(z)?, y(a)=A>0, y(b)=B>0. (2.41) 


Exercise 2.9 
Show that the equation of the straight line joining the points (a, A) and (6, B) in 
the left-hand diagram of Figure 2.2 is 
B-A 
ao (c—a)+A. 


Use this together with equation (2.41) to show that the surface area of the frustum 
of the cone shown in Figure 2.3 is given by 


S=n(B+A)/(b—a)? + (B— A). 


Recall that the frustum of a solid is that part of the solid that lies between two 
parallel planes which cut the solid, and its area does not include the area of the 
parallel ends. 


b—a 


Figure 2.3 Diagram showing the frustum of a cone, the shaded area. The slant 
height is /, and the radii of the circular ends are A and B. 


Show further that this expression may be written in the form 7(A + B)l where / is 
the length of the slant height and A and B are the radii of the end circles. 


The Euler-Lagrange equation 


2.5 Minimal surface of revolution 


2.5.2 Application of the Euler-Lagrange equation 


The integrand of the functional (2.41) does not depend explicitly upon z, 
hence the integrated form of the Euler-Lagrange equation (2.33) may be 
used. In this case, apart from an overall factor of 27, 


Giy,y') =yv1+y”, (2.42) 
SO 


/ 
eae aS : (2.43) 


—— = = ee | ee Es Si he) 
Oy 4/1 + y!? Oy’ \/1 + y!? 


Hence the Euler-Lagrange equation integrates to give 


a! se yl Agia (2.44) 
/1 he yy! * 
for some constant c; since y(x) > 0 we may assume that c is positive. By 
squaring and rearranging this equation we obtain the simpler first-order 
equation 


dy 


y2 — C2 a 
dr C ? y(a) — 0, y( ) a 0 ( 5) 


The solutions of equation (2.45), if they exist, ensure that the functional (2.41) 


is stationary. We shall see, however, that suitable solutions do not always 
exist and that when they do further work is necessary in order to determine 
the nature of the stationary point. 


2.5.3 The solution in a special case 


Here we solve the first-order differential equation (2.45) when the ends of the 
solid have the same radius, that is A = B > 0. Even this seemingly simple 
case has surprises in store and so provides an indication of the sort of diffi- 
culties that may be encountered with variational problems: such difficulties 
are typical of nonlinear boundary-value problems. This analysis is difficult 
and to understand it you may need to study it in depth; writing your own 
notes will probably be very useful. 


Because the ends have the same radius it is convenient to introduce a sym- 
metry by putting the ends at x = +a. This change, which is merely a 
shift along the z-axis, does not affect the differential equation (because its 
right-hand side is independent of x); the boundary conditions, however, are 
slightly different. If we denote the required solution by f(a), then it satisfies 
the differential equation and boundary conditions, 


df P—@ 
zs —“, f(-a) = f(a) =A> (2.46) 
A solution of this equation is obtained by putting c = A and f(x) = A, but 
this is not a solution of the original Euler-Lagrange equation (see Exer- 
cise 2.5). 


The structure of equation (2.46) suggests that we can use the identity 
cosh? z — 1 = sinh? z to define a new dependent variable ¢(x) by the equa- 
tion f(x) = ccosh ¢(x). Then the differential equation for ¢(x) is obtained 
by substituting this into (2.46) and using the chain rule, 


d 1 
oe sinh @ = +-\/Cc? (cosh? o- 1) = +sinh 4d, (2.47) 
C 


and hence 


© 


1 
=+-, which integratesto @=at < (2.48) 
dx C C 
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for some constant a. Therefore the general solution is f(x) = ccosh(a + 
z/c): the boundary conditions determine the values of the constants c and 
a, and these give the two equations 


A= ¢cosh (a + = | =<¢ (cosh a cosh e. + sinh a sinh = | as a= Gh. 
C C é 


A = ccosh (a x = | oe (cosh a cosh = + sinh a sinh = | at x = —a. 
C C C 


Subtracting these gives 0 = 2csinhasinh(a/c). If, however, z is real, sinh z = 
0 only if z = 0; hence, since a 4 0, we must have a = 0, giving the solution 


f(x) =ccosh(#) with c determined by A = ccosh (2). (2.49) 


The function f(x) is a solution of the Euler-Lagrange equation, but we need 
to determine the values of c such that it satisfies the boundary conditions. 


The function cosh z has a single minimum at z = 0, where its value is unity, 
and it increases monotonically for increasing |z|. Thus the constant c repre- 
sents the smallest value of f(x), which is at x = 0, as shown in Figure 2.4, 
where we plot the graph of f(a) for three values of c. 


c= 1/4 f(z) 
6 
eZ 9 
4 
c=mljz = 
he 
SS? 
=e GS oe 
_ —0.5 0 ft Be i 


IL 


Figure 2.4 Graphs of f(x) = ccosh(a/c) for |x| < 1 and c= 2, 1/2 and 1/4. Note 
that the interval |x| < 1 is not related to the position of the ends at x = =ca, 
except if a = 1. 


From these graphs we see that as c increases, f(z), becomes flatter in the 
interval shown. 


The required solutions are obtained by finding the real values of c satisfying 
equation (2.49). Unfortunately, it cannot be inverted to express c in terms 
of known functions of A. Numerical solutions may be found, but first it is 
necessary to determine those values of a and A for which real solutions exist. 


In this problem there are two lengths, a and A, that can be varied inde- 
pendently. Because there is no other length scale we expect the solution to 
depend only upon the dimensionless ratio of these two lengths. This reduces 
the number of independent parameters to one and makes the task of under- 
standing how the solutions change with a and A far easier. As an aside, 
we note that if A #4 B, there are two ratios and this makes it far harder to 
understand the general behaviour of the solutions. 


For this reason it is convenient to introduce a new dimensionless variable 
n = a/c. Our aim is to rewrite equation (2.49) in terms of 7, a and A, so we 
write the equation for c in the form 


1 
—=g(n) where g(n)= oo 1). (2.50) 


This equation shows directly that 7 depends only upon the dimensionless 
ratio A/a, that is, the ratio of the end radii to half the distance between the 
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ends. In terms of 7 and A, the solution (2.49) becomes 
cosh (= 
fiz) = - cosh (n—) z= gal: (2.51) 
n a cosh 7) 
The stationary solutions are found by solving the equation A/a = g(n) for 7. 


The graph of g(7), depicted in Figure 2.5, shows that g(7) has a single 
minimum and that for A/a > min(g) there are two real solutions. 


g(n) 
10 


wo F&F DD OW 


Figure 2.5 Graph of g(7) = (cosh 7)/n showing the solutions of the equation 
g(n) = A/a. 


This graph also suggests that g(7) — oo as 7 — 0 and ov, and this behaviour 
can be verified with the simple analysis performed in Exercise 2.11, which 
shows that 


1 e" 
g(n) ~ = forn7<1 and g(n)~ on for 7 > 1. (2.82) 


The minimum of g(7) is at the real root of ntanh7 = 1 (see Exercise 2.12). 
This may be found numerically, and is at 7,, ~ 1.2, and here g(7,,,) = 1.51. 
Hence if A < 1.5la there are no real solutions of equation (2.50), meaning 
that there are no functions with continuous derivatives making the area 
stationary. For A > 1.51la there are two real solutions both being stationary 
values of the functional (2.41); we denote these two solutions by 7, and 1p 
with 7, < 1. Because there is no upper bound on the area, neither solution 
can be a global maximum. 


Figure 2.6 shows values of the dimensionless area $/a? for these two station- 
ary solutions as functions of A/a when A/a > g(n,,) & 1.51. The area asso- 
ciated with the smaller root, 7,, is denoted by 5), and S2 denotes the area 
associated with 75. These graphs show that Sz > S; for A > ag(n,,) ~ 1.51a. 


S/a? 60 S2/a? 
50 
40 S1/a? 
30 
20 
1.5 1.75 2 Aue 2.9 2.75 3 
A/a 


Figure 2.6 Graphs showing how the dimensionless area $/a? varies with A/a, for 
A > 1.5la. 


The graphs depicted in Figure 2.6 were drawn using the expression for S 
expressed in terms of 7, Exercise 2.10, and by finding 7 for each A/a by 
solving equation (2.50) numerically for each root. 
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It is difficult to find simple approximations for the area S|f] except when 
A > a, in which case the results obtained in Exercise 2.11 may be used. We 
consider the smaller and larger roots separately. 


If A >a the smaller root, 7, is seen from Figure 2.5 to be small. ‘The ap- 
proximation developed in Exercise 2.11(a) gives 7, ~ a/A, so equation (2.51) 
becomes 


fi(x) ~ Acosh(a#/A) ~ A, (2.53) 


since |r| << a< A and cosh(x/A) ~ 1. Because f(x) is approximately con- 
stant, the original functional, equation (2.41), is easily evaluated to give 

Si A 

™. 


S, = S|fil=4raA or —=4 


= (2.54) 


The latter expression is the equation of the approximately straight line seen 
in Figure 2.6. The area 5S; is that of the cylinder formed by joining the end 
circles at x = +a with parallel lines. 


For the larger root, 7, since cosh 7 ~ e"”/2 for large n, equation (2.50) for 


becomes 
A 1 
— = —¢e! (2.50) 
c.- y 


(see Exercise 2.11(b)). So, from equation (2.51), we have 


te) =e [ow (SR) + (2) 


= Aexp (= (a — x)) + Aexp (-2(a + x)) . (2.56) 


a 
For positive x the second term is negligible (because 7, >> 1) provided 
INy >> a. For negative x the first term is negligible, for the same reason. 
Hence an approximation for fo(x) is 


fo(x) ~ Aexp (=F (a ~ |) provided |a|n. > a. or 


Some discussion of the behaviour of this function is provided after equa- 
tion (2.59). In Exercise 2.11(b) it is shown that the area is given by 


A\? 
So = S[fo] ~ 2nA? or 2 aw 3 (=) (2.58) 


which is just the area of the ends. The latter expression increases quadrati- 
cally with A/a, as seen in Figure 2.6. 


These approximations show directly that if A >> a then S2 > Sj, confirming 
the results shown in Figure 2.6. They also show that when A > a the 
smallest area is given when the surface of revolution approximates that of a 
cylinder. 


In the three parts of Figure 2.7, we show examples of these stationary solu- 
tions, computed numerically, for A = 2a, A = 10a and A = 100a. In the first 
example, on the left, the ratio A/a = 2 is only a little larger than min(g(n)), 
but the two solutions differ substantially, with f;(x), the solution associated 
with the smaller value of 7, already close to the constant value of A for 
all x. In the two other examples, the ratio A/a is larger, and now f(z) 
is indistinguishable from the constant A, while fo(x) is relatively small for 
most values of 2. 
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A= 2a A= 10a A = 100a 
1 1 1 
0.75 D.73 0.75 
0.5 0.5 OD 
0.25 falx)/A gas 0.25 
x/a x/a x/a 
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Figure 2.7 Graphs showing the stationary solutions f(a)/A = cosh(xn/a)/ cosh 7 
as a function of x/a, for various values of A. 


These examples and the preceding analysis show that when the rings are 
relatively close, i.e. A/a large, f(x) ~ A, for all x, and that as A/a — oo, 
fo(x) tends to the function 


¥ Iz| <a, 


fo(x) — fala) = A, tele (2.59) 


This result may be derived from the approximate solution given in equa- 
tion (2.57). Consider positive values of x, with rn, >a. If x = a(1 — 9), 
where 6 is a small positive number, then 


fo(x) ~ Ae~™., (2.60) 


However, from equation (2.55), In(A/a) = 7 — In(2n); if 7 > 1, In(2n) < », 
and 7 ~ In(A/a), the above approximation for fo(x) thus becomes 


fi) S @¥ o= ea =a). (2.61) 


Hence, provided 6 > 0, that is, x 4a, fo(x)/A — 0 as A/a — ov. 


The surface defined by this limiting function comprises two disks of radius A, 
a distance 2a apart, so has area Sg = 277A”, independent of a. Since this lim- 
iting solution has discontinuous derivatives at x = +a, it is not an admissible 
function, though there are admissible functions that are arbitrarily close to 
it. Nevertheless, it is important because if A < ag(n,,) ~ 1.5la, it can be 
shown that this surface gives the global minimum of the area and, as will be 
seen in the next subsection and in Section 2.6, this has physical significance. 
This solution to the problem was first found by B. C. W. Goldschmidt in 
1831, and is hence known as the Goldschmidt curve or Goldschmidt solution. 


You will not be expected 
to reproduce this type of 
argument in this course. 
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2.5.4 Summary of Section 2.5 


We have considered the special case where the ends of the solid generated by 

our surface of revolution are at x = -ka and each end has the same radius A. 

In this case the curve y = f(x) is symmetric about 2 = 0 and we have 

obtained the following results. 

1. Ifthe radius of the ends is small by comparison to the distance between 
them, A < ag(7,,,) ~ 1.5la, there are no curves described by differen- 
tiable functions making the traced out area stationary. In this case it 
can be shown that the smallest area is given by the Goldschmidt solution, 
fa(x), defined in equation (2.59), and that this is the global minimum. 

2. If A> 1.5la, there are two smooth stationary curves. One of these 
approaches the Goldschmidt solution as A/a — oo, and the other ap- 
proaches the constant function f(x) — A in this limit. The latter curve 
gives the smaller area. 


The nature of the stationary solutions is not easy to determine. In Figure 2.8 
we show the areas 5) /a? and S2/a’, as in Figure 2.6, and also the area given 
by the Goldschmidt solution, Sg/a? = 21(A/a)*, curve G, and the area of 
the cylinder, S,/a* = 47A/a, curve c. 


S/a? 60 
50 


1.5 1.75 2 2.25 2.5 2.49 3 
A/a 


Figure 2.8 Graphs showing how the dimensionless area S/a? varies with A/a. 
Here the curves S;,/a? denote the areas as in Figure 2.6, k = 1,2; G is the scaled 
area of the Goldschmidt curve, Sg/a? = 21(A/a)?; and c is the scaled area of the 
cylinder, 47A/a. 


If A > ag(7,,) ~ 1.51a, it can be shown that 5S; is a local minimum of 
the functional, but the graphs in Figure 2.8 show that for A < 1.895a the 
Goldschmidt curve yields a smaller area. A physical interpretation of these 
solutions is given in the next section. 


This relatively simple example of a variational problem provides some idea 
of the possible complications that can arise. 


Exercise 2.10 


b 
If f(a) = ccosh(a/c), show that the functional S[y] = am f dx y\/1+y" has the 


value 


S 27 4 
i = ~ (n+ sinh7cosh 7) , n= -. 
a n ‘ 
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Exercise 2.1 I 


(a) Use the expansion cosh = 1 + $7? + O(n") to show that, for small 7, g(7) = 
1/n + n/2+ O(n), where g(n) is defined in equation (2.50). Hence show that 
if A > a, then n ~ a/A, and hence that c ~ A and f(x) ~ A. Using the result 
obtained in the previous exercise, or otherwise, show that S; = 47Aa. 

(b) Show that if 7, is large, the equation defining it is given approximately by 

ee 
Qa 2Nn 


and, using the result obtained in the previous exercise, that 


Exercise 2.12 


(a) Show that the position of the minimum of the function g(7) = (cosh n)/n, 7 > 0, 
is at the real root, 7,,, of ntanhn = 1. 


By sketching the graphs of y = 1/n and y = tanh», for 7 > 0, show that the 
equation 7 tanh 7 = 1 has only one real root. 


(b) If a/c=7n,, and A/a = g(n,,), use the result derived in Exercise 2.10 to show 
that the area is S,, = 27A?n,,. 
Exercise 2.13 


Show that a solution of equation (2.46) is obtained by putting c = A and f(x) = A. 
Show also that this function is not a solution of the associated second-order Euler— 
Lagrange equation. 


Exercise 2.14 


(a) Show that the functional 


Sly] = / de Vy F¥), (1) =u) =4>0 


is stationary on the two paths 
1 
y(z) = — (4c +27) wherec? =cL =$(AtVA?-1). 
Ac? = 2 


In the following, these solutions are denoted by y;(x) and y_(), respectively. 


(b) Show that on these stationary paths 
1 
a 2 —= 
Sly] = 2+ eG, 
and deduce that when A> 1, S[y_] > S[y4], and that when A= 1, S{y] = 
4,/2/3. Show also that if A> 1 


Sle. | oe 4 As/? and Sly,] ~ 2VA. 
(c) Find the value of S|y] for the function 
0, O<4<21-6,0< 6< 1, 
yo(x) = A 


with ys(—2) = ys(x). Show that as 6-0, and «> 0, ys(x) — fa(x), the 
Goldschmidt curve defined in equation (2.59). Show also that 


lim $ [ys] = S[fa] = $4%?. 


This exercise is more 
difficult than average. 


This a long, fairly hard 
exercise that you might 
consider doing when 
revising this chapter. 
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2.6 Soap films 


An easy way of forming soap films is to dip a loop of wire into soap solution 
and then to blow on it. Almost everyone will have noticed that the initial 
flat soap film bounded by the wire forms a segment of a sphere when blown. 
It transpires that there is a very close connection between these surfaces and 
problems in the calculus of variations. ‘The exact physics of soap films is 
complicated, but a fairly simple and accurate approximation shows that the 
shapes assumed by a soap film are such as to minimise their areas, because 
the free-energy is approximately proportional to the area and equilibrium 
positions are given by the minimum of this energy. ‘Thus, in some circum- 
stances the shapes given by the minimum surface of revolution, described in 
Section 2.5, are those assumed by soap films. 


The study of the formation and shapes of soap films has a very distinguished 
pedigree: Newton, Young, Laplace, Euler, Gauss, Poisson are some of the 
eminent scientists and mathematicians who have studied the subject. Here 
we cannot do the subject justice, but the interested reader should try to 
obtain a copy of Isenberg’s The science of soap films and soap bubbles. The 
essential property is that a stable soap film is formed in the shape of a 
surface of minimal area that is consistent with its wire boundary. 


Probably the simplest example is that of a soap film supported by a circular 
loop of wire. If we distort it by blowing on it gently to form a portion the 
approximate shape of a sphere, when we stop blowing the surface returns 
to its previous shape, that is a circular disc. Essentially this is because in 
each case the free-energy, which is proportional to the area, is smallest in 
the assumed configuration. 


Imagine a framework comprising two equal circles of radius A, held a dis- 
tance 2a apart (like wheels on an axle). What shape soap film can such 
a frame support? Figure 2.9 illustrates the alternatives suggested by the 
analysis of the previous section and agree qualitatively with the solutions 
one would intuitively expect. 
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Figure 2.9 Diagrams showing two configurations assumed by soap films on two 
rings of radius A, a distance 2a apart. On the left, A < 1.89a, the soap film 
simply fills the two circular wires because they are too far apart: this is the 
Goldschmidt solution, equation (2.59). On the right, A > 1.5la, the soap film 
joins the two rings in the shape defined by equation (2.51) with 7 = 7. 


The left-hand configuration, with two distinct surfaces, is the Goldschmidt 
solution, equation (2.59), and it gives an absolute minimum area if A < 
1.89a. The shape on the right is a catenoid of revolution and represents the 
absolute minimum if A > 1.89a; it is a local minimum if 1.5la < A < 1.89a 
and does not exist if A < 1.5la. When 1.5la < A < 1.89a the catenoid is 
unstable and we have only to disturb it slightly, by blowing on it for instance, 
and it may suddenly jump to the Goldschmidt solution which has a smaller 
area, aS seen in Figure 2.8. 


C. Isenberg, The science of 
soap films and soap bubbles 
(Dover, 1992). 


2.6 Soap films 


The methods discussed previously provide the shape of the right-hand film, 
but the matter of determining whether these stationary positions are ex- 
trema, local or global, is of a different order of difficulty and beyond the 
scope of this course. The complexity of this physical problem is further 
compounded when one realises that there can be minimum energy solutions 
of a quite unexpected form. Figure 2.10 illustrates a possible configuration 
of this kind. We do not expect the theory described in the previous section 
to find such a solution because the mathematical formulation of the physical 
problem makes no allowance for this type of behaviour. 


Figure 2.10 Diagram showing a possible soap film. In this example a circular film, 
perpendicular to the axis, is formed in the centre and this is joined to both outer 
rings by a catenoid. 


The relationship between soap films and some problems in the calculus of 
variations can certainly add to our intuitive understanding, but this example 
should provide a salutary warning against dependence on intuition. 


As an example of the complex shapes that soap bubbles can form, consider a 
cubical frame of wire. When dipped into a soap solution then taken out, films 
of local minimum free-energy, that is minimum area, will form on this frame: 
it transpires that the possible shapes formed are many, varied and often 
counter-intuitive. Some of the possible shapes are shown in Figures 2.11— 
243. 


Figure 2.1 | Figure 2.12 Figure 2.13 


The above examples illustrate the fact that there may be more than one 
area of minimal surface with the same boundary conditions and shows that 
some physical problems that are simple to state can have bizarre solutions 
which are difficult to describe mathematically. 
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2.7 The brachistochrone 


The problem, described in Subsection 1.5.1 (page 21), is to find the smooth 
curve joining two given points P, and P,, lying in a vertical plane, such that a 
bead sliding on the curve, without friction but under the influence of gravity, 
travels from P, to P, in the shortest possible time, the initial speed at P, 
being given. It was pointed out in Subsection 1.5.1 that Johann Bernoulli 
made this problem famous in 1696 and that several solutions were published 
in 1697: Newton’s comprised the simple statement that the solution was a 
cycloid, giving no proof. There are several variants of this problem that may 
be treated using the methods described here; these include the addition of 
speed dependent resisting forces and constraints placed on the position of 
one or both end points. Here we consider the simplest problem: we shall 
prove algebraically that the stationary path is a cycloid, so first we need 
to define the cycloid. Even if you are familiar with this curve, you should 
find the following subsection of interest besides being a reminder of the 
mathematical description of the cycloid. 


2.7.1 The cycloid 


The cycloid is one of a class of curves formed by a point fixed on a circle 
that rolls, without slipping, on another curve. A cycloid is formed when 
the fixed point is on the circumference of the circle and the circle rolls on a 
straight line, as shown in Figure 2.14. 
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Figure 2.14 Diagram showing how the cycloid OPD is traced out by a point on 
the circumference of a circle rolling along a straight line. 


In this figure a circle of radius a rolls along the z-axis, starting with its 
centre on the y-axis. Fix attention on the point P attached to the circle, 
initially at the origin O. As the circle rolls, P traces out the curve OPD 
called the cycloid. 


The cycloid has been studied by many mathematicians from the time of 
Galileo (1564-1642), and was the cause of so many controversies and quar- 
rels in the 17th century that it became known as ‘the Helen of geome- 
ters’. Galileo named the cycloid but knew insufficient mathematics to make 
progress. He tried to find the area between it and the z-axis, but the best 
he could do was to trace the curve on paper, cut out the arc and weigh 
it, to conclude that its area was a little less than three times that of the 
generating circle — in fact it is exactly three times the area of this circle, as 
you can show in Exercise 2.15. Galileo abandoned his study of the cycloid, 
suggesting only that the cycloid would make an attractive arch for a bridge. 
This suggestion was implemented in 1764 with the building of a bridge with 
three cycloidal arches over the river Cam in the grounds of Trinity College, 
Cambridge. 


2.7 The brachistochrone 


Figure 2.15 Essex’s bridge over the Cam, in the grounds of Trinity College, with 
its three cycloidal arches. 


The reason why cycloidal arches were used is no longer known, all records 
and original drawings having been lost. However, it seems likely that the 
architect, Essex, chose this shape to impress Smith, the Master of Trinity 
College, who was keen to promote the study of applied mathematics. 


The area under a cycloid was first found by Roberval in 1634. In 1638 he also 
found the tangent to the curve at any point, a problem solved at about the 
same time by Fermat and Descartes. Indeed, it was at this time that Fermat 
gave the modern definition of a tangent to a curve. Later, in 1658, Wren, 
the architect of St. Paul’s Cathedral, determined the length of a cycloid. 


Pascal’s last mathematical work, in 1658, was on the cycloid and, having 
found certain areas, volumes and centres of gravity associated with the cy- 
cloid, he proposed a number of such questions to the mathematicians of 
his day with a first and second prize for their solution. However, publicity 
and timing were so poor that only two solutions were submitted and be- 
cause these contained errors no prizes were awarded, which caused a degree 
of aggravation among the two contenders Antoine de Lalouvére and John 
Wallis. 


At about the time of this contest Huygens designed the first pendulum clock, 
which was made by Salomon Closter in 1658, but was aware that the period 
of the pendulum depended upon the amplitude of the swing; a short discus- 
sion of this problem is given in Subsection 4.5.1. It occurred to Huygens to 
consider the motion of an object sliding on an inverted cycloidal arch and 
he found that the object reaches the lowest point in a time independent of 
the starting point. The question that remained was how to persuade a pen- 
dulum to oscillate in a cycloidal, rather than a circular arc. Huygens then 
made the remarkable discovery illustrated in Figure 2.16. If one suspends 
from a point P at the cusp, between two inverted cycloidal arcs PQ and 
PR, then a pendulum of the same length as one of the semi-arcs will swing 
in an cycloidal arc (SR which has the same size and shape as the cycloidal 
arcs of which PQ and PR are parts. Such a pendulum will have a period 
independent of the amplitude of the swing. 
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Huygens made a pendulum clock with cycloidal jaws, but found that in prac- 
tice it was no more accurate than an ordinary pendulum clock: his results 
on the cycloid were published in 1673 when his Horologium oscillatorium 
appeared. However, the discovery illustrated in Figure 2.16 was significant A more detailed account of 


in the development of the mathematical understanding of curves in space. | Huygens’ work is given in 
Unrolling time by J. G. 


Yoder (Cambridge 
P University Press, 1988). 
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Figure 2.16 Diagram showing how Huygens’ cycloidal pendulum, PT’, swings 
between two fixed, similar cycloidal arcs PR and PQ. 


The equation of the cycloid is obtained by finding the coordinates of P, in 
Figure 2.14, after the circle has rolled through an angle 6, so the length of the 
longer circular arc PA is a@. Because there is no slipping, OA = PA = a, 
and the coordinates of the circle centre are C = (a0,a). The lengths PB 
and BC are PB = —acos@ and BC = —asin@ and hence the coordinates 
of P are 


¢=a(@—sin0), y=a(1—cos8), (2.62) 


which are the parametric equations of the cycloid. For |@| < 1, x and y are 
related, approximately, by y = (a/2)(6a/a)?/? (see Exercise 2.17). 


If we assume that the y-axis in Figure 2.16 is in the direction PS, that is 
pointing downwards, the upper arc QPR, with the cusp at P is given by 
these equations with —7 < 6 < 7 and it can be shown that the lower arc is 
described by « = a(O + sin8), y = a(3 + cos@), and the same range of 0. 


Exercise 2.15 

Show that the area under the arc OPD in Figure 2.14 is 37a” and that the length 
of the cycloidal arc OP is s(9) = 8asin? (0/4). 

Exercise 2.16 


Show that the gradient of the cycloid is given by 
ae i - 1 
dx  tan(0/2) 


Deduce that the cycloid intersects the x-axis perpendicularly when 6 = 0 and 2r. 


Exercise 2.17 


By using the Taylor series of sin@ and cos@, show that for small |6|, 7 ~ ab? /6 
and y ~ a6?/2. By eliminating 6 from these equations, show that near the origin 
y & (a/2)(6x/a)?/3. 
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2.7 The brachistochrone 


2.7.2 Formulation of the problem 


In this section we formulate the variational principle for the brachistochrone 
problem by obtaining an expression for the time of passage from given points 
P, = (a, A) to P, = (6, B) along a curve y(x), using energy conservation. 


Define a coordinate system Oxy with the y-axis, oriented vertically upwards, 
as the starting point and the final point lying on the horizontal x-axis, so 
e= 8 =, as in Figure 2.17. 


e=f b 2 


Figure 2.17 Diagram showing the curve y(x) through P, = (0, A) and P, = (6,0) 
on which the bead slides. Here s(x) is the distance along the curve from P, to a 
point P = (x, y(x)) on it. 


At a point P = (z,y(x)) on this curve, let s(x) be the distance along the 
curve from P, to P, so the speed of the bead is defined to be v = ds/dt. The 
kinetic energy of a bead having mass m at P is smu" and its potential energy 
is mgy; because the bead is sliding without friction, energy conservation 


gives 


smvu> +mgy = E (2.63) 
where the value of the energy FE is given by the initial conditions, 
E= 4m +mgA, (2.64) 


where vp is the speed at P, = (0, A). From Figure 1.5 (page 9), small changes 
in s are given by 6s? = 6x? + dy’, and so 


ae\* dx \* dy : dx \* 
a vee, eon ea es pe taf (r*) . 2.65 
(Z) @ + (2) @ eee) ee) 
Thus on rearranging equation (2.63) we obtain 
ds\*  2E ne 
Now the time of passage from x = 0 to x = b is given by the integral 


i b 1 
oe ae a ee 2.67 
/ / ” da] dt sel) 


Thus on rearranging equation (2.66) to express dx/dt in terms of the function 
y(x), we obtain the required functional 


(2.68) 


This functional may be put in a slightly more convenient form by noting 
that the energy and the initial conditions are related by equation (2.64), so 
by defining the new dependent variable 


z(z4) =A+— —-y(z2), (2.69) 
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we obtain 
b 
1+ 2'(x)? 
= dx: 4 | ————-at 2.70 
| \ 29z(x) — 


Exercise 2.18 


Use equation (2.68) to show that the time taken for a particle of mass m to slide 
down the curve y = Ax from the point (X, AX), where it is stationary, to the origin 
1s 

Le 


ii se ¢ , 
2gA 


Show also that if the point (X, AX) lies on the circle of radius R and with centre at 
(0, R), so the equation of the circle is x? + (y — R)*? = R?, as shown in Figure 2.18, 
then the time taken to slide along the straight line from (X, AX) to the origin, O, 
is independent of X and is given by 


en a 
g 


This result was known by Galileo and was one reason why he thought that the 
solution to the brachistochrone problem was a circle. 


2.7.3 A solution 


The integrand of the functional (2.70) is independent of x, so we may use 
equation (2.33) (page 49) to write Euler’s equation in the form 


F 1 12 
z eae F =constant, where F(z,z') = ste ; (2.0) 
bra z 
and z(0) = vé/2g, z(b) = A+ z(0). Since 
/ 

Si (2.72) 

Oz! z{l 2 oe 
this gives 

12 1 iz 1 
‘ 2 (2.73) 


/2(1 4+ 2'2) z (gn ove 
for some positive constant c. Note that c must be positive because the left- 
hand side of equation (2.73) is negative. Rearranging the last expression 
gives ' 


z(1+ — =) ee a — —1. (2.74) 


This first-order differential equation is separable and can be solved. First, 
however, note that because the y-axis is vertically upwards, we expect the 
solution y(a) to decrease away from x = 0, that is, z(a) will increase so we 
take the positive sign in equation (2.74) and integration gives 


= Jes mer (2.75) 


Now substitute z = c’ sin’ ¢ to give 


os 22 | do sin? 6 = c? [ew (1 — cos 29). (2.76) 


(X, AX) 


é, L 


Figure 2.18 


Note that we ignore the 
factor (29)~1/?. 
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Integration then gives the solution in the parametric form 
2 = 50° (2¢ —sin2¢)+d and z=c’sin?¢= 5c°(1 —cos2@), (2.77) 


where d is a constant. Both c and d are determined by the values of A, 6} 
and the initial speed, vg. Comparing these equations with the parametric 
equations (2.62), we see that the required stationary curve is a cycloid. It 
may also be shown that this solution is the global minimum; a simple method 
of doing this is explored in Exercise 2.36. 


In the case that the particle starts from rest, v9 = 0, these solutions give 
g=d+ BC (26 — sin 26) 9. 44> 5c (1 — cos 2¢) , (2.78) 


where c and d are constants determined by the known end points of the 
curve. 


At the starting point y = A so here ¢ = 0; and since x = 0 it follows that 
d=0. This means that initially the particle falls vertically downwards. At 
the final point of the curve, x = b, y = 0, let @ = op. Then 


2b 2A 
— = 2¢p —sin2¢,, —> = 1— cos 2¢y, (2.79) 
C C 


giving two equations for c and @p. 


We now show that these equations have a solution and that it is unique. 
Consider the cycloid 


“u=20—sn20, v=i-—cos2?, 0<@< =. (2.80) 


The value of dy is given by the value of 6 where this cycloid intersects 
the straight line Au = bv. The graphs of these two curves are shown in 
Figure 2.19. 


Figure 2.19 Graphs of the cycloid defined in equation (2.80) and the straight line 
Au = bv. 


Because the gradient of the cycloid at 6 = 0 (u = v = 0) is infinite this graph 
shows that there is a single value of @» for all positive values of the ratio 
A/b. By dividing the second of equations (2.80) by the first we see that this 
root is given by solving the equation 
20—sin20 0b 
a ae O28 = @, (2.81) 
This equation is most conveniently solved numerically; however, if b/A is 
small a satisfactory approximation can be found. Once @¢» is known, the 
value of c is given from the equation 2A/c* = 1 — cos 2¢,, which may be put 
in the more convenient form c? = A/ sin? dp. 
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Exercise 2.19 
This problem is slightly 


Use the solution defined in equation (2.77) to show that on the stationary path the },,der than average. 


time of passage is 


_ [2A 
so acs -g sind, 


We end this section by showing a few graphs of these solutions and quoting 
some formulae that may help you to understand them. The analysis that 
follows is not an assessed part of the course, and you are not expected to be 
able to derive the quoted formulae. 


Figure 2.20 shows graphs of the stationary paths for A = 1 and various 
values of 6, ranging from small to large, so all curves start and end at the 
points (0,1) and (0,0), respectively, with 0.1 <b < 4. 


0.5 


=e 


=| 


Figure 2.20 Graphs showing the stationary paths for joining the points (0,1) and 
(6,0) ior 6 = 0.1, 1/2,.3, 7/2, 2, 3nd 4. 


From this figure we see that for small 6 the stationary path is close to that 
of a straight line, as would be expected. In this case ¢, is small and it can 
be shown that 
3b. 
= 94 ~ 2048 
Also, the time of passage is 


9A 3b 814 
T=,/—(1 as oe Se rh t ; 
; ( + op spat + 008) (2.83) 


By comparison, if a particle slides down the straight line joining (0, A) to 
(b,0), that is y/A+2/b =1, so z = Ax/b, then the time of passage is 


+O(b°) and A= 1 a ; (2.82) 


2A b? 
a— | ihe ee, 4. be A, 
2(A2 + b?) =| 2A? w) 
ist = ——— = : 4” (2.84) 
—{l4+w—, ie 
b za ( + 53 + Ol} )). b>A 
Thus, for small 6, the relative difference is 
b2 
ft; -T=7- _ 27, (2.85) 


8A? 
Returning to Figure 2.20 we see for small 6 the stationary paths cross the z- 
axis at the terminal point. At some critical value of 6 the stationary path is 
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tangential to the x-axis at the terminal point. We can see from the equation 
for x(@) that this critical path occurs when y’(¢) = 0, that is when ¢, = 7/2 
and, from equation (2.81), we see that this gives b = Az/2. On this path 
the time of passage is 


2A 4 
T=—,/— andalso Tez =7T4/1+— =1.188T. (2.86) 
zu T 


For 6 > Am/2 the stationary path dips below the x-axis and approaches 
the terminal point from below. For b >> Az/2 it can be shown that ¢p = 
tT —/Ar/b+ of? 2) and that the path is given approximately by 


b b 
u~ 5, (26 =—sn Io), go A— = sin? ¢, (2.87) 


and that 


r= (1 f2+ 4 (2) +). (2.88) 


Thus the time of passage increases with /b, compared to the time to slide 
down the straight line, which is proportional to b, for large 6. Further, the 
stationary path reaches its lowest point when ¢ = 7/2, where y = A — b/7, 
in other words it falls about 1/3 as far below the x-axis as it travel along 
it, provided b >> Az. That is, the particle first accelerates to a high speed, 
reaching a speed v ~ \/2gb/m, before slowing to reach the terminal point at 
speed v = \/2gA: on the straight line path the particle accelerates uniformly 
to this speed. 


2.8 Further Exercises 


Exercise 2.20 


Show that the Euler-Lagrange equation for the functional 


Sly] = / dz (y'’*—y* —2ay), y(0)=y(1) =9, 


is y’ +y = —z. Hence show that the stationary function is 
sin x 
ge) = —2x 
y(z) sin 1 


Exercise 2.21 


Consider the functional 
b 
Sly] =| dz (y'*+y*), y(a)=A, y(b) =B. 


(a) By forming and solving the Euler-Lagrange equation, show that the stationary 
path is 
B — Acosh(b — a) 


= Acosh 
- casual sinh(b — a) 


sinhu, uwu=2x-—<a. 


(b) By making the change of variable u = x — a and defining Y(u) = y(z(u)) = 
y(u +a), show that y’(a) = Y’(u) and that the functional becomes 


SY] =f a (Y'*+Y¥*), YO)=A, Y(6—a) +s 


Deduce that the stationary path depends only upon u = x — a and b — a (be- 
sides A and B). 
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Exercise 2.22 


Euler’s original method for finding solutions of variational problems is described 
in equation (2.5) (page 42). Consider approximating the functional defined in 
Exercise 2.20 using the polygon passing through the points (0,0), ($, y1) and (1,0), 
so there is one variable y,; and two segments. 


This polygon can be defined by the straight-line segments 


2yi2, ass 
y(x) = . 
2yi(1—2), =<a<1. 


By substituting this function into the functional, show that the corresponding poly- 
gon approximation to the functional becomes 


S(m) = +yf -— 3m 


and hence that the stationary polygon is given by y(1/2) = y; = 3/44. Note that 
this gives y(1/2) ~ 0.0682 by comparison to the exact value 0.0697. 


Exercise 2.23 


Find the functions that make the following functionals stationary. 


(a) Sly! = [ a (y'? + 12ry), y(0)=0, y(1) =2. 

(b) stl = fae, y(1) =A, y'(2) = B. 

(c) Sly] = [ a (2y?y/? —(1+a)y"), y(0)=1, y) =2. 
(d) Sly] = [es y’(0) =1, y(a) = A2, A> V2a. 


Exercise 2.24 


Find the general solution of the Euler-Lagrange equation corresponding to the 
functional 


b 
3h = / dedi @, 


and find explicit solutions in the special cases w(x) = \/x and w(x) = a. 


Exercise 2.25 


What is the equivalent of the fundamental lemma of the calculus of variations in the 
theory of stationary points of functions f(21,%2,...,%,) of several real variables? 


Exercise 2.26 
Consider the functional 
1 
Sly] = / dz (y'?-1)", y(0)=0, y)=A>0. 
0 
(a) Show that the Euler-Lagrange equation reduces to y’? = m*, where m is a 
constant. 


(b) Show that the equation y’? = m*, with m > 0, has the following solutions that 
fit the boundary conditions, y;(“2) = Az: 


A 
nz, 0) <= ee a 
yo(x) = gee m>A 
A+m(1— 2), < ee, 


The Euler-Lagrange equation 


2.8 Further Exercises 


and 
m—A 
—mM2x, <2 = 5 : 
wis) = ae aed m>A 
A-—m(1—32), ee 1. 
2m 


Show also that on these solutions the functional has the values 
S[y1] = (A? -—1)? and  S[y2] = S[y3] = (m* — 1)?. 


(c) Deduce that if A > 1, the minimum value of S{y] is (A? — 1)?, and that this 
occurs on the curve y;(x), but if A < 1, the minimum value of Sy] is zero, and 
this occurs on the curves y2(x) and y3(a) with m = 1. 


Exercise 2.27 


Show that the following functionals do not have stationary values, where, in all 
cases, y(0) = 0 and y(1) = 1. 


(a) / . dxy' ——_(b) / . dxyy’ — (c) / . dx xyy! 


Exercise 2.28 


Show that the Euler-Lagrange equations for the functionals 


situ = f deFeyy) and Saly] = [ae (Fle.uu!) + £60) 


are identical. 


Exercise 2.29 


Show that the functional 


Sly] =f acy? (Qe—y'), y(—-1) =0, y(1) =1, 


achieves its minimum value when 


ae . -i<2£< 6, 
ae al B22 ot 


a function for which the second derivative does not exist at x = 0. Show that, 
despite the fact that y’’(a) does not exist everywhere, the Euler-Lagrange equation 
is satisfied. 


Note: It can be shown that if y(x) is a solution of the Euler-Lagrange equation for 
the functional with the integrand F(x, y, y’), and if F has continuous first and sec- 
ond derivatives with respect to all its arguments, then y(x) has a continuous second 
derivative at all points where 0?F'/dy'? 4 0. In the present case, 07 F/Oy!? = —2y? 
(see page 50). 
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2.9 Harder Exercises 


Exercise 2.30 


Use the approximation (2.5) (page 42) to show that the equations for the values of 


y = (y1,Y2,---, yn), Where yx41 = yr +h, that make Sly] stationary are 
Os 0 0 0 
— = h—F n — fr n ——F n == tj, a oes Be 
a (Zn) + azn) — aE (zn41) n 


where 
ln. = (Ey, 8%), CH By  8= 
and yo = A and ynii = B. 
Show also that 
Znt1 = In +h(L,ynsYn) + O(R*), 
and hence that 
Os =1(> - O° F aoa OF eres 
OYn Ou. - Ovdvu “~"“Oudv ~" Ov" 
= —h (= (=) ss >) O(h?), 


where F' and its derivatives are evaluated at z = z,. Hence derive the Euler— 
Lagrange equations. 


) +00 


Exercise 2.31 


This exercise is a continuation of Exercise 2.22 and uses a set of N variables to 
define the polygon. Take a set of N + 2 equally-spaced points on the x-axis, x, = 
k/(N +1), k=0,1,...,N+1 with x =0 and ry4; = 1, and a polygon passing 
through the points (xz, yx). Since y(0) = y(1) = 0 we have yo = yn+1 = 0, leaving 
N unknown variables. 


Show that the functional defined in Exercise 2.20 approximates to 


N 
i (Ye+1 — Yk)? 2 2k eee 
s=>| h Ye ye | | Siew 


(a) For N = 1, the case treated in Exercise 2.22, show that this reduces to 


S(y) = Lyf ail SY. 


Explain the difference between this and the previous expression for S(y;), given 
in Exercise 2.22. 


(b) For N = 2 show that this becomes 
S = fyi + Sys — Gy ye — 2y1 — Sy, 
and hence that the equations for y; and yg are 


34y, — 18y2 = §, 34yo — 18y1 = §. 
Solve these equations to show that y(1/3) ~ 35/624 ~ 0.0561 and y(2/3) ~ 
43/624 ~ 0.0689. Note that these compare favourably with the exact val- 
ues, y(1/3) = 0.0555 and y(2/3) = 0.0682 obtained from the solution of Exer- 


cise 2.22. 
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2.9 Harder Exercises 


Exercise 2.32 


Consider the functional 


b 
Sly] = / dx F(y"(x)), 


where F(z) is a differentiable function and the admissible functions are at least 
twice differentiable and satisfy the boundary conditions 


y(a)= Ai, y(b)=Bi, y'(a)=A2, y'(b) = Ba. 


(a) Show that the function making S|y] stationary satisfies the equation 


OF 

ay! = c(x — a) +d, 
where c and d are constants. [Hint: Integrate the Gateaux differential by parts 
twice. | 


(b) In the case that F(z) = $27, show that the solution is 


y(x) = ge(z — a)* + 4d(a — a)? + Ao(w — a) + Ai, 
where c and d aa the equations 

-cD° + idD* = — A, —AoD, where D=b-da, 

$cD? + dD = By - a 


(c) Show that this stationary function is also a minimum of the functional. 


Exercise 2.33 


The theory described in the text considered functionals with integrands depending 
only upon z, y(x) and y'(x). However, functionals depending upon higher deriva- 
tives also exist and are important, for example, in the theory of stiff beams; the 
equivalent Euler-Lagrange equation may be derived using a direct extension of the 
methods described in this chapter. 


Consider the functional 
b 
Sty] | dz F(x,y,y',y"), y(a) = A1, y'(a) = Ao, y(b) = Bi, y'(b) = Bo. 


Show that the Gateaux derivative of this functional is 


b 
OF OF . , oF 
Asly.9] = | dx (o> +9 I oy +9 am) 


Using integration by parts show that 
b b 2 
OF d OF 
[easy [9a (ap): 


being careful to describe the necessary properties of g(a). Hence show that Sy] is 
stationary for the functions that satisfy the fourth-order differential equation 


@ (ar) 4 (oF), aF _y 
dx? \ Oy" dx \ Oy! Oy 


with the boundary conditions y(a) = Aj, y’(a) = Ae, y(b) = Bi, y’(b) = Bo. 


Exercise 2.34 


Using the result derived in the previous exercise, find the stationary functions of 
the following functionals. 


(a) Sly] =| dr(it+y"*), y@)=0) y/O)= 1,90) = 9) =4 
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Exercise 2.35 


(a) Show that the Euler-Lagrange equation for the functional 


b 
Sly] =| dx [y'(x)? — y(x)?],  y(a) = A, y(b) = B, 


a9 ty=90, yla)=A, y(b) =B. 


(b) Second-order equations of the above form occur frequently, but the boundary 
conditions are sometimes different, involving linear combinations of y and y/’. 
Thus a typical equation is 


d*y 
gee 


where h, and hp are constants. 


+y=0, hay(a)t+y'(a)=0, hoy(b) + y'(b) = 9, (2.89) 


Show, from first principles, that the functional 


b 
Sly] = hey(b)? — hay(a)* + / dz [y'(x)* — y(x)”] 
is stationary on the path that satisfies equation (2.89). 


Exercise 2.36 


In this exercise you will show that the cycloid is a minimum for the brachistochrone 
problem. The proof is in two stages. 


(a) Show that the ae ee T(z], defined in equation (2.70) (page 66) with the 
boundary conditions z(0) = A > 0, z(b) = 0, can be re-expressed in the form 


aja fae —— 


if the roles of x and z are eae that is, if z is taken to be the indepen- 
dent variable and x the dependent variable. 


(b) Consider the varied path Pie + €g(z), and show that 


i ee bie + O(e*) 


a Ge do cos* ¢ g'(z)*, 
0 


where z = c? sin? ¢ and x = $c?(2¢ — sin2¢). Deduce that T[x + eg] > T [2], 
for « £0 and all g(x), and hence that the stationary path is actually a mini- 
mum. 


Exercise 2.37 


In this exercise you will examine a simple example that emphasises the need to be 
very careful with the choice of admissible functions. 


Consider the functional 


suj= | deV1+y'?, y(0) =y(1) =9, 


for the distance between the two points (0,0) and (1,0) on the z-axis. If the 
admissible functions are everywhere differentiable then we saw in Chapter 1 that 
the minimum distance is given by the straight line joining the end points. Here you 
will show that we can construct a continuous function arbitrarily close to this path 
with a length larger than any given number. 


Consider an isosceles triangle ABC, where A and C are on the x-axis at x = 0 and 
1, respectively, as shown in the left-hand diagram in Figure 2.21, with height h, 
and AB of length J. 


The Euler-Lagrange equation 


Here we ignore the 
external factor 1/,/2g. 


2.9 Harder Exercises 


Figure 2.21 


(a) Construct the two smaller triangles AB; D and DB2C by halving the height 
and width of the triangle ABC, as shown in the right-hand diagram of F'ig- 
ure 2.21. Show that AB, = 1/2 and h; = h/2. Hence show that the lengths of 
the lines AB; DB2C and ABC are the same and equal to 21. 


(b) Show that after n such divisions there are 2” similar triangles of height 2~"h, 
and that the total length of the curve is 2/. Deduce that arbitrarily close to 
AC, the shortest distance between A and C’, we may find a continuous curve 
every point of which is arbitrarily close to AC’, but which has any given length. 
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Solutions to Exercises in Chapter 2 


Solution 2.1 


The first result follows directly from equation (2.5) because F’ is independent of 
x and y, y(a) = yo = A and y(b) = yn41 = B. The variable y, for each k = 
1,2,...,.N appears in only two terms of the sum, so 


OS O Yk — Yk-1 Yk+1 — Yk 
a Pen © Ue ee ek | oh Be 
OYK a2 | ( h - ( h 


and hence, since F' depends only upon y’ and not y, the stationary points are given 
by the equations 


OF pt ( We a ee Getcha hs I 
O"Yk h h 

Thus F’((yx — ye—1)/h) is independent of k, for all k = 1,2,...,N +1, so must 

equal a constant c, independent of k. Because this is true for all k, if F’(z) is 

continuous, yr — Yr—1 = constant and hence the points (xz, yz) lie on a straight 

line. 


Solution 2.2 


1 
(a) Since S[y + eg] = / dx (y’ + eg')*, we have 
0 


1 | 
<Sly +g] =2 | dx (y' + g')g’ and ASly.9] =2 | dx yg’. 
O O 


2 
(b) Since Sly + eg] = / dx x*(y’ + eg)”, we have 
1 


‘ . 2 
Cs tel=2f des%y +eq')g' and ASly.g)=2 f aea?y 
1 1 


m /2 
(c) Since S[y + eg] = / dx |(y’ + €g')* — (y+eg)*], we have 
0 


d nm /2 
<Sly + eg) =2 f del(y’ + e9/)g' - (y+ ea)a 
0 


m/2 
and ASly,g] = 2 / dx (yg — yg) - 
O 


b / 1\2 
(d) Since S[y + eg] = / dx wil we have 


a 


d b / / b 
«Sly +g) =2 f de Yt) 9 and Asty.g) =2 | je 
de a cad ‘ - 


b 
(e) Since Sly + €g] = / dx |(y’ + eg’)? + (y + eg)? + 2e*(y + €g)|, we have 


a 


r b 
zoe = 2 | dx [(y' + eg')g' + (y+ eg)g + e*g] 


b 
and AS|y,g] = 2 | dx [y’g' + (y+e*)g]. 


a 
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Solutions to Exercises in Chapter 2 


(f) Since Sly + eg] = I dx \/x? + (y+ €g)?\/1+ (y’ + €g’)?, we have 


d (y+ eg)g 
de | | 0 | of lg eg)" ( 


oie deg) (gy +49" jo" 
Ler a en) 


Solution 2.3 
(a) We have Si|[y + eg] = f. dx (y' + eg’), giving 


b 
AS; = / dx g’ = g(b) — g(a) = 0. 
(b) We have Sly + eg] = f. dx x(y’ + €g’)”, giving 
b 
BSo = 2 | dx xy’ gq’. 


(c) Since S3 = S; + Sg and Gateaux differentiation is a linear operation, 


AS3 = AS, + AS2 = AS2. 


Solution 2.4 
(a) Since OF /0y' = 2y' and OF /Oy = 1, the Euler-Lagrange equation is 
2y" —1=0. 
The general solution of this equation can be written in the form 
y= 4(¢-a) +(e sane. 
Putting x = a shows that @ = A, and then putting x = b gives 


B-A 


B-A=i(b—a)*+a(b—-a), so a= —_ 


ab 5(b = a), 
giving the solution 
B-A 
a= fur + (Fas -30-a)) ura 9 ea 
(b) If w= a —a, then the chain rule gives 
ar _ dy dx dy 


du dxdu dz 


and the functional becomes 
b—a 
s¥\= / dulY’*+Y], Y(0)=A, Y(b-a)=B. 
0 


The independent variable of the associated Euler-Lagrange equation is u, and 
a and b occur only in the boundary conditions and as the difference b — a. 
Hence the solution depends upon the three variables x, a and 6} only in the two 
combinations u = x — a and b—a. 
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Solution 2.5 
(a) If y is a constant and y(x) = y, equation (2.33) becomes G(y, 0) = —c. 


(b) The second-order Euler-Lagrange equation is 


OF? OF? , OF? OF 


By?” " ayayl” * dxdy’ dy 


If F(x,y,y’) = Gly,y’), the third term is zero, and if y = 7, this equation 
becomes G, (7,0) = 0, assuming that Gy (7,0) and Gy, (y,0) exist. 


Let g(y) = G(y,0) be a function of y. The equation G'y(7,0) = g(y) = 0 shows 
that y must be at a stationary point of g(y), whereas the equation Gy, 0) = —c, 
found in part (a), imposes the weaker restriction that c lies in the domain of 


G7, 0). 
Thus, in general the constant solution y = ¥ of the first integral is not a solution 
of the Euler-Lagrange equation. 

Solution 2.6 


Using the result of Exercise 1.16, we see that if G does not depend explicitly upon 2, 
OG/Ox = 0, and 


/ a OG _ OG _ SO 0°G 2 Oe 5 
dx \ Oy’ Oy) Oy’ ad Aydy’” By? 


But, using the chain rule, 


dx \" dy} ~~ dy \Y dy! Oy! \" dy! 


so the right-hand side of the previous equation becomes 


a SC) 9G 4 eye 
da \Y oy) dy” ~ Oy" ~ dz 8 dy)” dx da \ Oy | 


The left-hand side of this equation is zero if either y/(x) = 0 or y(a) satisfies the 
Euler-Lagrange equations. In the latter case, setting the right-hand side to zero 
and integrating gives 
,OG 
Y oo 
Y 


Oy Oy! Oy! Oy! 2 


G = c = constant. 


This is a first-order differential equation: its general solution will depend upon 
one other arbitrary constant, d, and to find the solution of the original problem 
we need to express the constants c,d in terms of the constants A,B defined in 
equation (2.33) (page 49). Often this is difficult, because it involves the solutions of 
nonlinear algebraic equations, and frequently there are real solutions only for some 
values of A and B. 


Solution 2.7 


In this case F = y’? — y’, giving OF /Oy’ = 2y’ and OF /Oy = —2y which leads to 
the Euler-Lagrange equation y” + y= 0. The general solution of this equation is 
y = Acosx + Bsinz, where A and B are arbitrary constants determined by the 
boundary conditions. The boundary condition at x = 0 gives A = 0, and that at 
x = X gives the solution 
sin x 
y(z) = sin X ’ 

provided that sinX £0. If sm X = 0, that is, X = nz, n= 1,2,..., there is no 
solution. 
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Solutions to Exercises in Chapter 2 


Solution 2.8 


(a) In this case F = y'?2 + y? + 22y, OF /Oy' = 2y' and OF /Oy = 2y + 22, giving 
the Euler-Lagrange equation y” — y = x (for this type of equation see Block 0, 
Subsection 1.3.2). The general solution of this second-order inhomogeneous 
equation is y = Acoshax+ Bsinhxz — xz, where A and B are arbitrary constants 
determined by the boundary conditions. ‘The boundary condition at x = 0 gives 
A= 0, and that at x = 1 gives the solution 


sinh x 
sinh 1 


y(z) = (1 +a) 
(b) Consider the difference 6S = Sly + eg] — Sly], where y is the above solution: 


1 
is=e | dx (g'* +9") >0 
0 
for all non-zero g(x). Hence the functional has a minimum along this solution. 


Solution 2.9 


The general equation of a straight line can be written as y = m(a—a)+c. The line 

passes through (a, A), so c= A, and through (b, B), so B = m(b—a)+ A. Hence 
B-—A 

is the required equation. 


Substituting this into equation (2.41), with u = x — a, gives 


b—a _ LC —— 2 
Sly| =2n | du y ih t+ shea 
0 b=a b= @ 


__ VOoe eee ans [(B — Aju + A(b — a)] 


ea 


= 7(B+ A) (b-—a)? +(B-A)?. 
Pythagoras’ theorem gives 1? = (b— a)? + (B — A)’, hence S = 7(B + ADI. 


Solution 2.10 


If f(a) = ccosh(a/c), f’(x) = sinh(x/c) and the functional (2.41) (page 52), with 
the appropriate change to the limits, becomes 


Sif = ane f dx cosh(a/e)\/1 + sinh? (a/c) 


n 
= Arc? ducosh?u, u=-, 1 
a) i SS 


= 2nc* (7 + sinh ncosh 7) , 


where we have used the relations cosh” vu = cosh 2u + 1 to evaluate the integral, and 
sinh 2u = 2sinh ucosh u to cast the result in this form. Dividing this by a?, we see 
that the dimensionless area $[f|/a? depends only upon 7: 

S\f| 2a 


— oe (7 + sinh cosh 7). 
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Solution 2.1 | 
(a) Using the expansion cosh 7 = 1 + $7* + O(n*), we obtain the small 7 expansion 
of 9(7) 


1 ee | 
g(n) = — coshn = — + =n + O(n’), 
n 2 ( 


so for small 7 the solution of the equation g(7) = A/a is n~a/A. But since 
n = a/c this gives c~ A and 


f(x) = Acosh(z#/A) ~ A since |r| <a < A. 


With f(z) = A the area is S = 4zAa. Alternatively, since 7 = a/A <1, so 
coshyn ~ 1 and sinhn ~ 7 the result derived in the previous exercise gives 
Slf1|/a? = 47/n, and hence S; = 47Aa. 


(b) The equation for 7 can be written as 
A 


1 
—- (e"+e"") 


e!l 


—2n 
- (l+e-°"). 


If 7 >> 1 the e~?” term is negligible by comparison to 1; for instance, if 7 = 3, 
e—2" = 0.0025, and if 7 = 5, e~7” = 0.00005. Hence the equation becomes 


A 1 

—=—e! +}. 

aa (7 > 1) 
For large 1, 


cosh 7 = Se" (1 + e~*") ~ se" and sinhn= se" (1 — a ~ se", 


SO 


a? 2n n 
Since 7 > 1, e2” > n, that is, e?"/n? >> 1/n, so the first term dominates. 


S 2 ae 
SU) 7 (n+ te") =20 (5) scl 


Solution 2.12 


(a) The derivative of g(n) is g'(n) =~‘ sinhn — 7-* cosh, which is zero when 
ntanhy = 1. The graphs of y = tanhy and y = 1/7, for 7 > 0, are shown in 
Figure 2.22: tanh” increases monotonically from zero to unity as 7 increases 
from 0 to infinity, and 1/7 decreases monotonically from infinity to zero over 
the same range of 7, hence there is one and only one positive real root of 
“Hianbh7 = 1, 


Figure 2.22 Graphs of y = tanhy and y = 1/7. 
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A numerical calculation shows that g/(7) = 0 at n= 7,, = 1.1997 and here 
g(a.) = 1.5089. 


(b) At the stationary point the area is, using the result obtained in Exercise 2.10 


1 1 
S = 2na? (— + — sinh n,,, cosh rm) 


Nm m 


ed — (1 + sinh? Tn) 


m 


> 2 
= weal tie. 


m 


since 1,,, sinh»,,, = cosh»,,. But, by definition, 


A 1 
— = 9(%m) = — coshy,, 
a "mn 

hence 


f= Dna" 9 9G.) = Bt Hy. 


Solution 2.13 


With c= A and f(x) = A, equation (2.46) and the boundary conditions are satis- 
fied. The Euler-Lagrange equation is, since G = y\/1+ y’?, 


[se eee. 
dx (we) - <—s 


With y = constant this reduces to 1 = 0; hence the function y(x) = A is not a 
solution. 


Solution 2.14 


(a) The functional does not depend explicitly upon xz, so we may use the first inte- 
eral of the Euler-Lagrange equation y/OF'/Oy’ — F = constant, where Fy, y’) = 


Jy(1+y’2). This gives /y = c\/1+y"", where c is a positive constant. Re- 


arranging this equation then gives the first-order differential equation 
dy : 
2 2 
Le | ae, —1)=y(1) = A. 
( z) y ASH) 


This equation is separable, so can be written in terms of two integrals as 


d 
c / a / ax, 
Te ce 
and integration gives 


2 
2cVy —C? =x+a or y=or4 Eta 
C 


for some constant a. The boundary conditions at x = +1 give 
iF —1)? 
erly 2 pa (a — 1) | 
Ac? 4c? 
Hence a = 0 and A=c?+1/4c?. This last equation is a quadratic in c? so 
gives 


G=diai(atVa— ie 


Hence, if A > 1 there are two solutions of the Euler-Lagrange equation, but 
none if A < 1. The two solutions are 


1 
4c. 


A=CH+ 


veia) = (4c4. +2 s 


Typical graphs of y.(x) are shown in Figure 2.23: note that for large values 
at A, o.(e) = A. 


S| 
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Figure 2.23 Graphs of y+(x) for A = 1.2 (left) and A = 8 (right). 


Substituting the ag solution (for any c) into the functional gives 


ee 
1 


=i da (4c* += *) 
. 

=2c+ — 63° (2.90) 
In order to determine which path gives the largest value of S{y], we consider 
the difference 


S{y-]— Sts] =2(c--e4) +8 (4 - a) 


2 2 
= fer “neinhi| = +e ee 
(ey sea) 
= $(c, —c_)((A-1)>0 if A>1, 


where we have used the relations c,c_ = j and ci. +c? = A, which follow 


directly from the original quadratic equation for ci. This relation shows that 


Sly_| > Sly+] for A > 1. 
If A=1, ce. =c_. = 1/V2 See = a Paes, OAS 1 wetiave 


2 _A i+ a _A iz ee ae ee 
- Ss A} 2 2A2 8 AA | 


where we have used the binomial expansion V1 — 2 = 1 — x — au? +--- 
Hence 


1 1 
2 
A= A(1- ga-ga et), 


and on taking the square root 


= VA(1- og (1+ ete \y"=va(i-go+--) 


4A? 4A? 
Similarly, 


1 1 1 1 
ss Ce ee ee ivi ee a ees. ee 
fe: a(t+zet ) giving Cc —a(itaet ) 


Putting c. = VA and c_ =1 / 2\/A, we obtain the approximations 
1 
a and y= 7 + Ax® ~Az?, A>1. 
Substituting these approximations for c into the integral (2.90) for S we obtain 
SlyzJ~2VA and Sly_]~ 4 A/? A> ti. 
For A close to 1, we find the value of S[y+] by setting A? = 1+ B’, where B 
is a small positive constant. This gives 
ee ae oan -) 


Pe ze 
= 1 BP +B), ae Pe 
= 3 (vit ee v3(5 ae er ee 
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which gives 
3 


4 B? B 
a lee Cm aw aa el 


This shows that, as expected, at A = 1 (B = 0), S[y_] = S|y,], but also that 
the two curves join tangentially at A = 1, as shown in Figure 2.24. 


1.5 y) 2.5 3 3.5 4 4 


oS 


Figure 2.24 Graphs of Sly+]. 


(c) The Goldschmidt curve is defined by 


0, 42) < 1, 


fa(x) = ' 7, 


so y'(x) is not defined at |x| = 1. Hence we define a function that approaches 
fa(x) as 6 — 0 for some parameter 6. We need consider only positive values 
Gr #: 
0, i <o<i-% On Fes. 
Y5\L) = A 
) A-~(1—2), l-—4< 2 <1. 


The graph of this function is shown in Figure 2.25. 


ys (x) 
A 


Figure 2.25 Graph of the function ys(z). 


The value of the functional on y5(x 


Siys|=2 [ sa a a 
_, 
fs 9 VU 
= —\/ A(A? + 6°) a 1—-, wherev=1-gz, 
} 0 ‘) 


= 4,/A(A2 +6°) = $A? as 5 > 0. 
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Solution 2.15 


For a curve defined parametrically by the functions x(@), y(@), the area under it 
and between 6 = 6; and @» is 


x(@2) O05 dx 

A= / dz yl) = / dé —y(6). 
x(61) 04 dé 

For the cycloid, x(@) = a(@ — sin@), y(@) = a(1 — cos@), therefore 


27 27 
A=a | dB (1 — cos)? =a? | dO (1 — 2cos6 + cos’ 6) = a?(2a + 1) = 3a”. 
0 0 


For the length of a curve, we use a variant of equation (1.7) (page 9). Suppose 
that @ increases from 0 to 6+ 60, then to O(6@), x and y increase by x’(@) 66 and 
y'(@) 60, respectively. Hence the length of the small element of the curve is, using 
Pythagoras’ theorem (see Figure 1.5, page 9), 


6s = \/x'(0)2 + y'(0)2 60 + O(6507), 
and the length of the curve between 6; and 62 is 


02 
= / dg /u'(d) + yO? 


i 


For the cycloid, x'(@) = a(1 — cos@), y’(@) = asin@ and the length of the arc OP is 


0 
s=a/ do \/ (1 — cos ¢)2 + sin? ¢ 
He 
=a dd \/2 — 2cos @, 
0 


i 
sie 2a f do sin(¢/2) 
= 4a (1 —cos(0/2)) = 8asin? (0/4), 


where we have used the identity cos z = 1 — 2sin?(z/2) twice. 


Solution 2.16 


The gradient is 
dy dy dx asin @ 1 


dx do’ do a(1 — cos 6) * tan(0/2)’ 


where we have used the identities sin2w = 2sinwcosw and cos2w = 1 — 2sin? w. 
The cycloid is perpendicular to the x-axis when the gradient is infinite, that is, 
when tan(@/2) = 0, or 6/2 = nz, n = 0,1,... 


Solution 2.17 


The Taylor series for sin# and cos @ are given in the Handbook; the first few terms 
are 


sind =0—20°+O0(0°), cos =1—40°40(6*). 
Hence 
x = a(0 —sin@) = 4a0° + O(6?) and y=a(1—cosé) = sa + O(6"). 


1/3 


The first equation gives 0 = (6x/a)*/’, and substituted into the equation for y this 


gives 


y = 5 (62/0). 
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Solution 2.18 


(a) The initial energy is EF = mgAX, and since x decreases during the fall, equa- 
tion (2.68) becomes 


0 z 
ee Ji se 


x V2gAX — 2gAzx 
_ ite? [ ee ie 
= 2gA 0 \/ X — 2 


1+ A? 
=2 WX. 
2gA 


(b) The initial point (X,Y), where Y = AX, satisfies the equation 
x+y - BF =e, 
which becomes (1 + A*)X = 2AR. Substituting this into the above equation 
for T’ gives the required, rather surprising, result. 
Solution 2.19 


It is convenient to write z(¢) in the form z = c* sin’ ¢, use the fact that z/(x) = 
z'(b)/x'(@), and express the integrand of the functional in terms of ¢: 


_ f” ,,d [1+ aoe 
r=} “ib Dgaleonn 9 


zr’ c(1—cos2¢)  tand’ 


SO 


Solution 2.20 


In this case F = y’* — y* — 2zy, so OF /Oy' = 2y’, OF /Oy = —2y — 2x, and the 
Euler-Lagrange equation is the linear second-order inhomogeneous equation (see 
Block 0, Subsection 1.3.2) y/’ + y +a =0. The general solution of this equation is 


y = Acosx+ Bsinzg — «z. 


The boundary condition at x = 0 gives A = 0, and at x = 1 we have 0 = Bsin1 — 1, 
giving the required solution. 


Solution 2.21 


(a) OF /Oy' = 2y' and OF /Oy = 2y, so the Euler-Lagrange equation is y”” — y = 0. 
The general solution of this equation can be written in the form 


y = acosh(x — a) + @sinh(z — a). 
Putting x = a shows that a = A, then putting x = 6 gives 


B — Acosh(b — a) 


B= Acosh(b—a)+Gsinh(b—a) so G= Baa Ss ela aad 


giving the solution 
B — Acosh(b — a) 
sinh(b — a) 


y = Acoshu+ sinhu, u=2-—a. 


(b) If u =a -—a, then the chain rule gives 
dY dy dx dy 


du dxdu dx’ 
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and the functional becomes 
b—a 
siy] =| du(V'? 4¥2), “Y(0) =A, Y@—a)=B. 
0 


The independent variable of the associated Euler-Lagrange equation is u, and 
a and b occur only in the boundary conditions and as the difference 6b — a. 
Hence the solution depends upon the three variables x, a and 6 only in the two 
combinations u = «x — a and b—a. 


Solution 2.22 


Using the given trial function, the functional becomes 
1/2 
S(yi) = / dx [4y4 — 4yFa? — 4y, 27] 
0 


1 
+ / dx |4y{ — 4y{(1 — x)* — 4y,2(1 —2)| 
1/2 

= [Yy? — dm] + [Buf - 3m] = Fut — 3y- 
This function is stationary at the root of S’(y,) = 22y,/3 — 1/2, that is, y; = 3/44 ~ 
0.0682. 


Solution 2.23 


(a) In this example F = y/* + 12ry and OF /O0y’ = 2y’, OF /Oy = 12x”. Hence the 
Euler-Lagrange equation is y’” = 6x, y(0) = 0, y(1) = 2, having the general 
solution y = x°? + Ax + B, which satisfies the condition at x = 0 if B = 0 and 
the condition at z = 1 if A = 1. Hence the stationary path is y = 2° + a. 


(b) Inthis case F = y’?/x? and OF /Oy’ = 2y'/x? giving the Euler-Lagrange equa- 


tion 
d y’ dy 2 
=\e)= ae lead 


for some constant a. The general solution is therefore y(x) = ax’ /3 + 3. The 
boundary conditions give 


A= a+ 8, B=dq. 50 a= iB and B=A- SB. 


Hence y(x) = B(x? — 1) /124+ A. 


(c) In this example F = 2y?y/? — (1+2)y? and 
OF F 
—— = dy*y’ — = Ayy’? — 21 
Byte tae Ge ae (l+x)y 


The Euler-Lagrange equation is 


d dy dy ; 
2— Ty — }— ay — 1 = 
. (u Z| v (Z) + (1 +ejg=e 


which simplifies to (yy’)’ + $(1 +x) = 0. Integrating this gives 


dy cha go 1 >, 

= = — — 1 ——— 

Ge ge) ee 

and integrating again, y(x)? = B+ Ax — z(1 + x)°. The boundary conditions 

then give y(0)? = B- %=1,soB= Zand y(1)? = f4+A—-8=4,s0A= 2 
Hence the solution is 


ee (1 pe =0 sayy 9e see (1 Pec weee 


The solution is written in this way because it is easier to understand. The 
cubic f = (1+ 2)(6+ 2)(4—2) is zero at x = —6, —1 and 4; f is positive for 
x < —6 and negative for x > 4. It follows that y is real only for x < 24, for 
some x; < —6, and possibly for some x in the interval —1 < x < 4, depending 
upon the magnitude of f in this interval. Numerical calculations, which you 
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are not expected to do, show that 7, ~ —6.33 and that y is real in the interval 
(—0.264, 3.59). 


(d) In this case F = y/y’? is independent of x, OF /Oy’ = —2y/y/° and we may 
use the first integral, equation (2.33), to give y/y’? =a, y'(0) = 1, y(a) = A’, 
where a is a constant. The boundary condition at « = 0 gives y(0) =a. At 


x =a, y(a) = A* > 0 and hence we assume, for the present, that a > 0 and put 


a =c’, with c>0. The differential equation becomes, on taking the positive 


square root, because y’(0) = 1>0 and c> 0, 
d - 2 
dy_ Vy dy _ 2 


dx C sae c 
Hence the solution that fits the boundary condition at r = 0 is /y=c+ax /' 2. 
The boundary condition at 2 = a then gives 2c? — 2Ac + a = 0 giving 


= 5 (A+ VA? — 2a). 


Both roots are positive and real provided A > 2a. In this case we have the 
two solutions 


2 
usa) = (co + =] and ca = 5 (At VA? = 2a). 


264. 


Since y/y’? = a = c’, the value of the functional on these solutions is S[y+] = 
ac, and so S[y4] > S[y_]. 


Finally, consider the possibility that y/y’? = —c?, c > 0, so y(0) = —c? < 
0. The general solution that satisfies the boundary condition at x = 0 is 
y = —(c+2/(2c))*, but this cannot be made to satisfy the boundary con- 
dition: at x = a. 


Solution 2.24 


The Euler-Lagrange equation is 


d (we) \_, 
de \ fi+y(@)?} 
which integrates to w(x)y'(xz) = A,\/1+ y/(x)?, where A is a constant. Rearranging 


this and integrating again gives the general solution 


vel = Be A 


I 
ui ————_———.. 
,/w(u)? — A? 
If w(x) = Jz, this becomes y(z) = B+ Af” du; —z and hence y(x) = C+ 
2AVax — A?, where C is a constant. 


If w(x) = x the general solution becomes y(x) = B+ A f° du ==> Nae EE giving y(x) = 
C + Acosh *(a/A). 


Solution 2.25 


For a function of N variables a stationary point is where 
N 
OG 
S" &,5— =0 for all &;,. (2.91) 
4 Ox, 


The fact that the sum is zero for all €,, means that OG/Ox;, = 0 for all k; this is 
the equivalent of the fundamental lemma of the calculus of variations. 
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Solution 2.26 


(a) 


(b) 


Since F = (y/? — ay and OF /dy’ = 4y’ (y’? — 1), the first integral of the 
Euler-Lagrange equation, equation (2.33), is (y’? — 1)(3y/? + 1) = constant. 
Hence y’? = m? for some constant m, which we assume positive. 


The solutions of the equation y'(x)* = m? that satisfy the boundary condition 
y(0) = 0 are y(x) = +maz, m > 0. Hence one solution that fits the boundary 
condition at « = 1 is y = y; = Az and on this path S[y,] = (A? — 1)?. 


Another solution has the form 
mi, Gieenf<4 
y(x) = 
C— MX, . = a0 < 1, 
where m, c and € are constants. The boundary condition at x = 1 givesc = A+ 


m. Since the solution needs to be continous at x = € we also have m& = c — m& 
and hence € = (A +m)/2m. 


Because m > 0 and € < 1 it follows that m > A; for m =A we regain the 
solution y = Az, but for m > A we obtain y2(x). 


Another solution is 
—mz, U<a2<t 
y(x) = 
c+me,. £4 28. 


The boundary condition at x = 1 gives c= A—m and the continuity condition 
gives —mé =c+mé, and hence € = —c/2m = (m — A)/2m. Since € > 0 this 
gives m > A, as before. This gives the solution y3(2). 


If A > 1, the minimum value of the functional is (A? — 1)? and this is given by 
the solution y = Az. 


If A < 1, we may choose m = 1, for y2, and m = —1 for y3 to give the minimum 
value of zero. 


Solution 2.27 


(a) 


This integral can be evaluated directly, S{y| = y(1) — y(0), and its value is 
independent of the path, regardless of the boundary values. 


Similarly S{y] = }(y()? — y(0)2). 


Since F = ryy’, OF /Oy' = xy, OF /Oy = xy’ and the Euler-Lagrange equation 
is y = 0, which does not satisfy the boundary conditions. 


Alternatively we have 


i e d 1 ea 1-7 - 
Slyj== | drx—y? = |= | > f deyt= 5-5 f dev’. 
w= 5 [ dee ty? =| seule?) —5 f aev=5-5 f aes 
The Euler-Lagrange equation for the functional on the right-hand side of this 
equation is again y = 0. 


Solution 2.28 


We expect the Euler-Lagrange equations for these two functionals to be identical 
because 


Soly] = Sily] + [G (2, y(2)) 12 


and the boundary term is independent of the path. Now we derive the result 
directly. 


Consider the Euler-Lagrange equation for S2[y]. First define 


.. / dG | / OG OG , 
F(2,y,y) = F(z,y,y)+ = Ple,y 9 + * ay! 
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so that 
rie call 6 Fa and Lien meen 
Oy Oy Oxdy Oy Oy. .Oy'.. Oy 
Hence the Euler-Lagrange equation for F’ is 
4 fae. 4 (ek () a, 
ae oF Oy dx \ Oy’ Oy dx \ Oy OxdOy Oy? 
But, 


4 (a) _ #6 ea, 
dx \ Oy) Oxdy By? 7 


so the Euler-Lagrange equations for F and F are identical, as expected. 


Solution 2.29 


Clearly S[y| > 0 and for the given solution the integrand is identically zero, so for 
this solution S = 0, its minimum value. The Euler-Lagrange equation is 


(y"” — 2) y? +2(y! — 2x) yy’ — (y’ — 2a)" y = 0, 


which is satisfied by the functions y(x) = 0 and y(x) = x”. Thus the given function 
satisfies the Euler-Lagrange equation except at x = 0 where y’(x) is not defined. 


Solution 2.30 


If y = (1, Y2,°°'; Yn) with yo = Aand yn41 = Band year = ye +A, then yp, occurs 
only in the k =n and k =n-+1 terms and 


OS O mn 9 Gn O n ~ ret 
OYn h 


=< 0 0 0 _ Yn — Yn-1 
= ha Fen) — Raf (en) yf en+1)s a= (2.0 Tae eer ve ) 


Now we need to express (%n+1 — Yn)/h in terms of (Yn — Yn—1)/h. First, write 


Ynt1 — Yn _ Yn ~ Yn-1 x Uo) oie A te 
h h h 


and use the Taylor expansion 


Yn+1 — 2Yn + Yn-1 = y(n 7 h) aaa 2yl 2x) > Yy (Ln ns h) 


= y(Ln) + hy (ex) * thy" (Ln) " ah? y’" (tn) ve O(h*) — 2y(2n) 


+ af dn) = hy' (tn) - Shy" (xn) = En? y'" (an) a O(h*) 
= h?y" (t,) + O(h*). 
Hence 


which gives 
—s OF  ,OF wae 9 
Flenti) = Flen) +h (FE tan ge tang) + OU?) 
It follows that the equation for 0S/Oy, becomes 
OS OF O {OF Pe; A 9 
Be, h 5 (Sn +5) + O(h*), 
OF OG. , 0G . 486 9 7 
F & Tig Ye =) POW a 


as [oF 4d (dF ‘ans 


h 


89 


90 Chapter 2 


Since OS/Oy, = 0 it follows that 


d (OF OF 
= (Ge) ~ 55 OM) 1 2. cy Fe 


and that as h — 0 we obtain the Euler-Lagrange equation. 


Solution 2.31 


In this more general case we use the approximations 


1 1 7 2 
/ de <(x @=ndow z(t) and / toe oP any (Ae =2G0)) ; 
0 0 


where z(x) is any function and the set of equally spaced points 7, = k/(N + 1) 
defined in the question. Hence the functional becomes 


N 
sa? Pia wes pee Dyan h=>—, 


$ ee — ye) uK) =" 2k 
a "Wai" 


(a) If N =1 there are two terms in the sum; the first is y?/h, since yo = 0, and 
the second is (1/h — h)y? — hy, and since h = 1/2 this gives 
S(yi) = Lut Syl. 


This function is stationary where 0$/0y, = 7y; — 1/2 = 0, that is y) = 1/14 = 
0.0714, compared to the exact value of y(1/2) = 0.0697. 


The difference between this approximation to S and that obtained in Exer- 
cise 2.22 is because the approximations to the functional are different. In both 
cases we approximate the solution by the same type of polygon. However, in 
the first case we evaluated the integrals exactly, whereas in the second case we 
made an additional approximation to evaluate the integrals. For the approxi- 
mation used in Exercise 2.22 we have 


1 1/2 1 
/ dx y' (x)? = 4y? | dx +f da) = 4yi, 
0 0 1/2 
1 1/2 1 1 
/ dx y(x)* = 4y? / dx x? +f de (1 — x) |. = =e, 
0 0 1/2 3 
1 1/2 1 1 
/ dz 2xy(x) = 4y, / dx x” +/ dxx(1—x)} = =y1. 
0 0 1/2 2 


For the approximation used here, these integrals are approximated by 


i 


1 
/ dx y' (x)? =2 (yeti — ye)? = 4y7 
0 


jah) 
; Z 1 2 | 1 2 : k 
| dey(2)’ = 5 > ve = si and |, dx 2ey(2) = ) 5 um = 5u- 


(b) If N = 2, then h = 1/3, y3 = 0 and 


1 2 1 A 
S=3y, + ~ —y)* - 3 (02 oe cn) + Ee —3 (02 ra sun) 


Soa 5 ag 6 Z 4 
3 > v1 3 Y1¥2 9 41 9 2: 
The stationary points are at the solutions of 
GOs: g4 2 OS 34 4 
A = aera Cee ee RS 


9 Oy2 a 
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which simplify to the given equations. These have the solutions y; = 35/624 ~ 
0.0561 and yo = 43/624 ~ 0.0689, which are the approximate values of the 
solution at x = 1/3 and 2/3 respectively. 


Solution 2.32 
(a) The Gateaux derivative, equation (2.14) (page 45), of this functional is 
d 
ASly, 9] = Sly + €g] 


b 
OF 
= I 
= =f dx g (57 


Integrating by parts twice gives 


wo Lane) 


OF 2. OF 
=i (x) Oy!’ ae sii [ a1 t) 73 Oy" : 


But g(x) and g’(x) are both zero at x =a and i so for the functional to be 
stationary we need 


@ (OF 
dx2 Oy!” 


for some constants c and d. 


OF 
=e Ome 2) + a, 


) =(. Integrating this twice gives 5 
Y 


(b) If F(z) = $2? the differential equation for y(x) is y’(x) = c(v — a) +d. Inte- 
erating this twice gives 
y'(x) = $e(x@ —a)* + d(ax—a)+a and 
y(x) = £ce(x — a)’ + 5d(a — a)* +a(x —a)+ 8. 
The boundary conditions at x = a give y'(a) = Ap = a and y(a) = A; = BG, so 
y(z) = ee(z — a)? + 4d(x — a)* + Ao(x —a) + At, 
and the constants c and d are determined from the boundary conditions at 
x = b. Setting D = b—a the two equations y(b) = By and y’(b) = Bz become, 
respectively, 


1-p%44dD?+A,.D+A,=B, and 5cD*+dD + Az = Bo, 
which simplify to the quoted equations. 
(c) Consider the general functional Sly =f. dx F(y"), so 
O° F 


; UT] OF i 2 : ' 2 
Sy tea =Sul+e f deg"@)55 + 5¢ f dea" (a 5a 


and on the stationary path 


Some. Cae E 
Sly + eg] — Sly! =5¢/ dx g (1) aya to 


Since g(x)? > 0 the sign of this integral depends upon 0°F'/0y" 4. Ty the 
present case, however, F(z) = 27/2, F(z) =1 and hence the integral is posi- 
tive and the stationary path is a minimum. 
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Solution 2.33 


First, note that if y(x) and y(xz) + eg(a) are both admissible functions then g(z) 
and its derivative, g'(x), are zero at x = a and b. The Gateaux derivative, AS|y, g] 
(see Equation (2.14)), is 


: d : d / / hf // 
lim, “Sly +9] = | dx —F(aytegy tegiy +e) 
Thus 
b 
OF OF OF 
AS = d — utes is 


Integration by parts gives 


[i OF A ¢ a (oF 
; Oy) ~«Pay|. J, eee 


Since g(a) = g(b) = 0 the boundary term vanishes. Similarly, 


b b b 
OF OF ad 7 OF 
i ee / = eae 
/ dx g Oy" ae 9 al / dx g dx (57): 


_ bah 28 far + fare OF 
“3 T By!" om Gy” J |. bs oe Oy" } 


Again the boundary terms vanish because g’(a) = g’(b) = 0 and g(a) = g(b) = 0. 
Hence 


: OF d (OP. # f OF 
asiwal= [arate | ~ ae (ay) + ae (apr) | 


Using the fundamental theorem of the calculus of variations we see that a necessary 
condition for the functional to be stationary on a function y(x) is that it satisfies 
the equation 


ae OF _ a OF oe 
dx? \ dy!’ dx \ Oy’ 


with the given boundary conditions. 


e=0 


Solution 2.34 


(a) If fF =1+y"(x)? the required derivatives are OF /Oy" = 2y"(x) and OF /dy’ = 
OF /Oy = 0, so the equation for the stationary function is d*y/dx* = 0. The 
general solution of this equation is the cubic y(x) = ax? + br? + cx + d, where 
the constants a, 6, c and d are determined by the boundary condition. Those 
at x = 0 give y(0) = d=0 and y’(0) =c = 1; those at x = 1 then give y(1) = 
a+b+1=1 and y’(1) = 3a+ 20+1=1, so that a=b=d=0, c=1 and 
the solution is y(x) = a. 

(b) In this case OF /Oy” = 2y”", OF /Oy' = 0, OF /Oy = —2y, so the equation for the 
stationary function is 

d*y / - (nr 

a4 © y(0) = 8p Fae OY Eye ey yak 
The general solution of this is y(x) = Acosx + Bsinx + Dcoshx+ Esinhz. 
The boundary conditions at + = 0 give 


y(0)=A+D=1 and y’(0)=B+E=0 
and those at x = 7/2 give 
y(3)=B+Dc+Es=0, y' (%)=-A+Ds+ Ec=-1, 


where c = cosh(7/2) and s = sinh(z/2). Using the first two equations to sub- 
stitute for D and EF in the second two gives 


(s—1)B+Ac=c and Bc+(s+1)A=s++l. 
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These equations have the solution A = 1 and B = 0, hence EF = D = 0, and 
the required solution is y(z) = cos. 


Solution 2.35 


(a) 


(b) 


In this case F = y/? — y* so OF /Oy' = 2y', OF /Oy = —2y giving the Euler— 
Lagrange equation y” + y = 0. 

Suppose that y(x) and y(x) + eg(x) are two admissible functions with y(«) 
satisfying the two given boundary conditions. Then if A = Sly + eg] — Sly], 


b 
A = 2 [hg(®)u(d) — bag(a)y(a)l + 2e / eet ep SOO. 


a 


Integrate by parts to put this in the form 
AS = 2€[9(b) (y'(b) + hoy(b)) — g(a) (y'(@) + hay(a))| (2.92) 


b 
-2€ f dx (y” + y)g + O(e*). 


But hay(a) + y'(a) = 0 and hpy(b) + y’(b) = 0, so 


b 
A= -2¢ | dx(y” + y)g + O(e?). 


The fundamental lemma of the calculus of variations shows that a necessary 

condition for S[y] to be stationary is that y(x) satisfies the differential equation 
// 

y +y= 0. 


Note that use of the fundamental lemma here needs some care. As stated 
earlier (see page 47), the function g(x) needs to be zero at x = a and b. Here, 
however, since both y(x) and y(x) + eg(x) are admissible functions we have 


hag(a)+g9/(a)=0 and hyg(b) +g'(b) =0 


so the lemma does not apply directly. But the same method can be used on 
the open interval a < x < b, so we may deduce that on this interval y” + y = 0, 
and use continuity to extend the interval to a <a <b. 


Solution 2.36 


(a) 


Since dz/dx = 1/(dx/dz) we have V1 + 2/2 = V1+ 2/2/z’ and hence, to within 
an irrelevant multiplicative constant, 


re) = fo de YEE = fa 
a az |z'|,/z 0 z 


since x’(z) < 0. 


Use the result given in Exercise 1.7 (page 14) and the fact that the term O(e) 
is, by definition, zero on the stationary path to cast the difference in the first 
required form. Now change the integration variable from z to @, using the result 
x'(z) = x'(¢)/z'(@) = tan @. The integral exists and is positive and hence the 
stationary path is a minimum. 


Solution 2.37 


(a) 


(b) 


The triangles ABC and AB,D are similar and AC is, by construction, twice 
AD. Hence AB is twice AB, and h = 2h}. 


At each iterations of this procedure the number of similar triangles doubles. 
After n iterations we therefore have 2” similar triangles: the heights are 2~”h 
and the length of the side is /2~”. But since there are 2 x 2” sides the total 
length is 2l, the original distance along ABC. 


Since / is arbitrary the distance along the curve may be made as long as 
required and by choosing n sufficiently large, we will never stray further than 
any prescribed small distance from the straight line AC. 
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CHAPTER 3 


Further developments of the 
theory 


3.1 Introduction 


This chapter contains three main topics which continue the development of 
the general theory of the calculus of variations in the direction needed for 
the reformulation of Newton’s equations as a variational principle. 


First we consider the effects of changing variables, both dependent and in- 
dependent variables, normally denoted by y and x respectively in previous 
chapters. This is important because one of the principal means of solving 
complicated differential equations is to find new variables that simplify the 
problem. Here we show that when a problem can be formulated as a varia- 
tional principle, the algebra involved in changing variables is made easier. 


Secondly, we describe how the theory is generalised to include many depen- 
dent variables. This generalisation is essential when we reformulate New- 
ton’s equations because separate variables are required for each degree of 
freedom; for instance, if a particle moves in a plane, two dependent vari- 
ables are necessary to describe its position, whereas two particles moving in 
a plane require four variables. This generalisation is relatively simple and 
involves no new ideas. 


Section 3.5, the final, most difficult (and optional) part of this chapter deals 
with invariance properties. ‘hese are important because, when present, 
drastic simplification of the Euler-Lagrange equations becomes possible. A 
precursor to part of this theory was introduced in the previous chapter where 
it was shown, in Exercise 2.6 (page 51), that under certain circumstances 
integrals of the motion exist, so the second-order Euler-Lagrange equation 
can be integrated directly to give a simpler first-order equation. This simpli- 
fication was used to help solve the minimum surface area problems and the 
brachistochrone, in Sections 2.5 and 2.7 respectively. In Section 3.5 we de- 
scribe a more general principle from which the first integral may be derived, 
and in Subsection 3.5.2 this principle is used to derive a natural generali- 
sation of the elementary result obtained in Exercise 2.6. Students knowing 
some dynamics will be aware of how important conservation of energy, lin- 
ear and angular momentum can be: the theory described in Subsection 3.5.2 
unifies all these conservation laws. ‘This theory is, however, difficult and is 
not assessed; we therefore suggest only a cursory reading if it is found to be 
too hard. 
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3.2 Invariance of the Euler—Lagrange 
equation 


In this section we consider the effect of changing both the dependent and 
independent variables, and show that in both cases the form of the Euler— 
Lagrange equation remains unchanged. Such a theory is important because 
one of the principal methods of solving a differential equation is to change 
variables with the aim of converting it to a standard recognisable form. For 
instance, the unfamiliar equation 


d’y dy y= Naat | 
becomes, on setting z = x!/* (x > 0), the familiar equation 
d?y 
oe =e. a 
ae (3.2) 


It is rarely easy to find suitable new variables, but if the equation can be 
derived from a variational principle the task is made easier because the 
algebra is usually simpler: you will see why in Exercise 3.1, where the above 
example is treated for a = 2; see also Exercise 3.24. 


Here we deal with the case of only one dependent variable, but important 
examples involving many dependent variables are given in the next chapter. 
Indeed the full power of this technique becomes apparent mainly in the 
advanced study of dynamical systems. 


When deriving the Euler-Lagrange equation we represented the path be- 
tween end points in a Cartesian coordinate system. This coordinate system 
is neither always the most convenient nor the most useful: for instance, 
rather than using the Cartesian coordinates (x, y), it may be better to rep- 
resent a path in terms of polar coordinates (r,#), where x = rcos@ and 
y =rsin@. Indeed, it is precisely this change of coordinates that prompted 
Euler’s concern with the problem of the invariance of his necessary condi- 
tions. In modern parlance, this translates to the form of the Euler-Lagrange 
equations being invariant; we shall see exactly what this means in the fol- 
lowing two sections. 


In the first section we consider the simpler types of transformation in which 
only the independent variable is changed. In the second section we consider 
Euler’s original problem in which paths are described in both Cartesian 
and polar coordinates, the aim being to understand the changes induced in 
the Euler-Lagrange equations. In both cases we shall derive the important 
result that the form of the Euler-Lagrange equation remains unchanged. 


3.2 Invariance of the Euler-Lagrange equation 


3.2.1 Changing the independent variable 


The easiest way of understanding why the form of the Euler-Lagrange equa- 
tion is invariant under a coordinate change is to consider the effect of chang- 
ing only the independent variable x. Thus for the functional 


b 
Sly] = / dz F(x, y(x),y'(x)), (3.3) 


we change to a new independent variable u, where x = g(u) for a known 
function g(w), assumed to be invertible for a < x < b. 


We note that a continuous function f(x) is invertible for a < x < bif there is 
an inverse function g(x) such that f(g(x)) = x. Often the notation f~!(zx) 
is used to denote the inverse g(x). For instance, if f(x) = \/z, for x > 0, 
then f*(e)= <°. 


With this change of variable y(x) becomes a function of u and it is convenient 
to define 


Y(u) = y(g(u)) = y(). (3.4) 
The chain rule gives 


dy dydu_ Y"(u) _ Y'(w) (3.5) 
dx dudx dz/du g'(u)’ . 


so the functional becomes 
d / 
Y"(u 
stv] = [du wr (9(w).¥w, oe). (3.6) 
¢ g'(u) 
where the integration limits, c and d, are defined implicitly by the equations 


a = g(c) and b = g(d). Note, in order to avoid a proliferation of symbols we 
use the same symbol, S, for the original and the transformed functionals. 


The integrand of the original functional depends upon 2, y(x) and y’(z). 
The integrand of the transformed functional depends upon u, Y(w) and 
Y’(u), so if we define 


Flu, ¥(u),¥'(u)) = 9'(u) F (aw), ¥(u), = | | (3.7) 


the functional becomes 
d 
sty] = / du Flu, ¥ (aoa): (3.8) 


Because this functional depends upon u, the new dependent variable, Y (w), 
and its first derivative the derivation of the associated Euler-Lagrange equa- 
tion is exactly the same as for the original functional. The Euler-Lagrange 
equation in the new variable, u, is therefore 


df ar OF 

ie (> ] “a nae 
in contrast to the original Euler-Lagrange equation 

d (OF OF 

— | — | —- — =0. 3.10 

da 4 Oy sa 


These two equations have the same form, in the sense that the formula (3.9) 
is obtained from (3.10) by replacing the explicit occurrence of x, y, y’ and F 
by u, Y, Y’ and F respectively. The new second-order differential equation 
for Y, obtained from (3.9) is, however, normally quite different from the 
equation for y derived from (3.10), because F and F' have different functional 
forms. 
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Consider, for instance, the functional 


2 ‘2 
Sly] = / de, y(1)=1, v2) =2, (3.11) 


which is similar to the example dealt with in Exercise 2.23(c) (page 70). The 
general solution for the associated Euler-Lagrange equation is derived in the 
solution for this exercise and is y(x) = 3+ ax°/3; the boundary conditions 
give G=6/7 and a = 3/7. 


Now make the transformation © = u%, for some constant a. The chain rule 
y] 
gives 


dy _dydu _ Y"(u) 


ae dude qqent? Where ¥(u) =y(u") = y(2), (3.12) 
and the functional becomes 
gi/a / . gi/a } 2 
11 (YW) 1 Y"(u) 
_ a—l = 
S[Y| —_ a | duu 24 (—) = =f du pe es . (3.13) 


Now choose 3a = 1, so the functional simplifies to 
8 
sty] =3/ x Vist RSs (3.14) 
1 


The Euler-Lagrange equation for this functional is Y”(u) = 0, having the 
general solution Y = C+ Du. The boundary conditions give C + D = 1 and 
C+ 8D = 2 and hence S[Y] is stationary when 


Y(u)=2(6+u) giving y(z) =Y(u(z)) = 7 (6+ gts. (8:15) 


In this example little was gained, because the Euler-Lagrange equation is 
equally easily solved in each representation. This is not always the case, as 
the next exercise shows. 


Exercise 3.1 


The functional 
x 
sul = fo dx (yay), X>0, 
0 


where w is a constant, gives rise to the Euler-Lagrange equation y’ + wy = 0. 


(a) Show that changing the independent variable to u where x = u? gives the 
functional 


SY = af du —- — uu?) , where Y(u) = y(u?) 


and U = VX, with the associated Euler-Lagrange equation 


PY do 2 


Show that this is the same as equation (3.1) when a = 2 and w= 1. 
(b) Show that if z = u?, then 
d7y 1 ( ey x) 


eee! (SS u—_ 
dx? te & ade* = dus 
and hence derive the above Euler-Lagrange equation directly. 
Note that the method described in part (a) for transforming the differential 
equation requires only that we compute dy/dx and avoids the need to calculate 


the more difficult second derivative, d?y/dx?, required by the method used in 
part (b). 


3.2 Invariance of the Euler-Lagrange equation 


Exercise 3.2 


A simpler type of transformation involves a change of the dependent variable. Con- 
sider the functional 


b 
Slyl = / hel? 


(a) Show that the Euler-Lagrange equation for this functional is y” (a) = 0. 


(b) Define a new dependent variable z related to y by the differentiable function 
y = G(z) and show that the functional becomes 


b 
Big| = / dx G’ (z)*z'*. 
Show also that the Euler-Lagrange equation for this functional is 
[G'(z) 2” + G"(z) 2'*] G'(z) =0, 


and that this is equivalent to the original Euler-Lagrange equation provided 
G'(z) #0. Note that the condition G’(z) 4 0 means that the equation y = 
G(z) may be inverted to give z as a function of y. 


3.2.2 Changing both the dependent and 
independent variables 


In the previous section, particularly in Exercise 3.1, it was seen that when 
changing the independent variable the algebra is simpler if the transforma- 
tion is made to the functional rather than to the associated Euler-Lagrange 
equation. The reason for this is simply that changing the functional involves 
only first derivatives, whereas transforming the equation directly involves 
second derivatives. 


For the same reason it is far easier to apply more general transformations 
to the functional than to the Euler-Lagrange equation. The most general 
transformation we need to consider will be between the Cartesian coordi- 
nates (x,y) and two new variables (u,v): such transformations are defined 
by two equations 


r= f(u,v), y= 9(u,r), (3.16) 


taking each point (u,v) to a unique point (x,y), and vice versa. Rather 
than deal with this general case, however, it is easier to consider a particular 
example that highlights all relevant aspects of the analysis. Thus we consider 
the transformation between the Cartesian coordinates (x,y) and the plane 
polar coordinates (r,@) where 


e=cecost, »g=rsind, r > 0, —7 <0 <a. fa-27) 
The inverse transformation is given by 
x 
r?=az*+y*, cos?=-, sind= m (3.18) 
r , 


In Cartesian coordinates we normally choose x to be the independent vari- 
able, so points on the curve joining (a, A) to (b, B), depicted in Figure 3.1, 
are given by the Cartesian coordinates (x, y(Z)). 
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Figure 3.1 Diagram showing the relation between the Cartesian and polar 
representations of a curve joining (a, A) and (0, B). 


Alternatively, we can define each point on the curve by expressing r as a 
function of 6, and then the curve is defined by the polar coordinates (r(6), @). 
If the end points of the curve have the Cartesian coordinates (a, A) and 
(b, B), then equation (3.18) gives the polar coordinates of these points as 


A 
To = Var + A’, cos Oy = —, sin Oa = —, 
‘ : (3.19) 


b B 
r, = V0? + B?, cosh, =—, sin& = —. 
Tb lp 
The aim is to transform a functional 
b 
su = [dx F(e.y(2),9'(e)), ula) = A, u(0) = B, (3.20) 


to an integral over @ in which x, y(x) and y’(x) are replaced by expressions 
involving 0, r(@) and r’(@). First we change to the new independent variable 
0: then since x = rcos@ and y = rsin@, we have 


% dx 
S= / do a (r cos, rsin 6, y'(x)) . (3.21) 
Ba 


The derivative dxz/d0@ is obtained from the relation x = rcos@ using the 
product rule and remembering that r depends upon 0: 


dx dr , 
70 79° 6—rsin0. (3.22) 


It remains only to express y'(x) in terms of r, 0 and r’(@). First, note that 
dy dyd@_ dy dx 


Now use the product rule again to give 
CY GF < 
a 7p ne +rcosé. | (3.24) 
Hence 
dy a a ae (3.25) 
dx  r'(0)cos@—rsiné 
and the functional becomes 
Op 
Siri = / d0 F(@,r(@),7'(0)), “r(@.) =a, (05) = ro, (3.26) 
6a 


where 
‘(@) sin 6 0 
F = (r'(0) cos0 — rsin 0) F (“ cos 6,r sin 6, Ted ne) heart) 
and where (rq, 9,) and (rp, 4) are the polar coordinates of the end points of 
the curve. 


3.2 Invariance of the Euler-Lagrange equation 


The new functional depends only upon 6, r(@) and the first derivative r’(@), 
so the Euler-Lagrange equation is 


d (OF OF 

a(ge)- m7 8 a.28) 
which is the transformed version of 

ae for OF 

Rate (paid ree aie es, | .. 

da a) rs (3.29) 


This analysis shows that the transformation to polar coordinates keeps the 
form of the Euler-Lagrange equation invariant because the transformation 
of the functional introduces only first-order derivatives, via equations (3.22) 
and (3.25), so does not alter the derivation of the Euler-Lagrange equa- 
tion. The same transformation applied to equation (3.29) involves finding a 
suitable expression for the second derivative, d*y/dx*, which is harder. 


As an example of the reverse transformation, consider the functional 


Oy 
Bir} = / dO \/r? + r'(0)?, (3.30) 
Ba 
already expressed in polar coordinates. Here, 
FPR Swear, (3.31) 
SO 
OF/Or=r/Vr2+r'2 and OF/Or =1'/Vr2+r!2. (Sede) 


The independent variable is 0, so the Euler-Lagrange equation is 


a ( pen | ees SoS (8.38) 
Bi fea) Jeu 

Expanding this gives 

r (rr +r’) r 


A a, 3.34 
Vier? (+r? pe Vere men 


which, on multiplying through by (7 +r’ 2)3/ ai simplifies to 


2 d 2 
rn = (SF) —r2=0. (3.35) 


In order to transform the functional to Cartesian coordinates we need the 
derivative d0/dx, which can be obtained by differentiating tan @ = y/x with 
respect to 2: 


Ld yiz).¥ 
dO _ _¥ 3.36 
cos? 6 dx r all ie) 
giving 
dO sxy'-y 
ae oe ag 


since 1/cos* @ = 1+ tan? 6 = (x7 + y*)\/z?. .Also, r? = 2? + y?, and differ- 
entiation with respect to x gives 


dé 
et aay (3.38) 


and hence 
dr (yy +2)r 


eo 3.39 
dé cy’ —y ( ) 
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Thus the functional becomes 


> dé 
= ay '(Q)2 
Sly| / dx av" + r'(@) 


b / ! 2 
ae L 
=| —— n+ (ae Pr? 
eal xy! —y 


b / 
1 vy —y 2 2 
= dx —————.4/ (xy' — y)° + (yy’ + 2)’. (3.40) 
| reg aa 
Now assume that ry’ — y 4 0, a condition we shall discuss a little later, so 
/ — 

ea eae 11, | (3.41) 

Izy’ — y| 


and remember that the Euler-Lagrange equation is unaffected if the func- 
tional is multiplied by a constant factor. Also, we have 


(xy’ — y)* + (yy! +2)° a (x? + y?) aft (x? + y?) 
= r2 (1 +y'*) . (3.42) 


Hence, the functional is equivalent to 


b 
Sty] = / dn Jt oe (3.43) 


This is the functional treated in Subsection 1.2.2, where it was shown 
(page 10) that the Euler-Lagrange equation reduces to y/(x) =constant, 
with general solution y = mx +c. On this solution the condition ry’ — y 4 0 
becomes c # 0, which means that the path must not pass through the origin. 
The reason why this condition has occurred here is simply because the trans- 
formation to polar coordinates, x = rcos@, y = rsin@, is not well behaved 
at the origin which is represented by r = 0, for all 6, so here the transforma- 
tion is not invertible. It is worth noting that the equation y’(x) = 0, when 
expressed in polar coordinates, is just equation (3.35). 


This example illustrates the simplification that can occur when suitable 
transformations are made: the art is to find such transformations and fre- 
quently one is guided by geometric insight, but we do not expect you to 
master this art here. 


Exercise 3.3 


(a) Show that, in polar coordinates, the functional 


b Oo 
sul = | dx v(x? + y?\/1+y'2 becomes sri= | ddrvr2 + r'2, 


6a 
and that the resulting Euler-Lagrange equation is 


dr 3 (dr\* 
a ee 
dé xr \ dé 
Note: you will need to assume that r’(@) cos @ — r(6) sin@ 4 0, which is equiv- 
alent to assuming that |y’(x)| is bounded. 
(b) Show that this equation may be written in the form 
a $1 gas 
d6? \ r2 ro” 
and hence that the equations for the stationary paths are 


1 
— = Acos26+Bsin20, thatis A(x? — y*)+2Bay = 1, 
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where A and B are constants. The following identities are useful: 


cos 26 = cos?@—sin?@ and sin20 = 2sin@cos@. 


Exercise 3.4 


A simple transformation is obtained by interchanging the roles of the dependent 
and independent variables. For instance, consider the elementary functional 


b 
sul = | dry), yla)=A, 9b) =B, 


which depends only upon y’, and was shown in Chapter 1 to be stationary along 
the straight line joining the points (a, A) and (0, B). 


Using the fact that dy/dx = show that if y becomes the independent vari- 


1 
dx/dy’ 
able, the functional becomes 


B 
S[z] = / dy G(2’), 2(A) =a, y(B) =), 


3.3 Functionals containing many 
dependent variables 


In Chapters 1 and 2 we considered functionals of the type 


b 
Styl = / dx F(a,y,y'), ya) = A, yb) =B, (3.44) 


which involve one independent variable, x, a single dependent variable, y(x), 
and its first derivative. There are many useful and important extensions to 
this type of functional and here we discuss the generalisation needed for the 
re-formulation of Newtonian dynamics described in the next chapter: others 
will not be considered but, in order to provide some idea of other possible 
developments, we list a few of the more important examples. 


(a) The integrand of the functional (3.44) depends upon the independent 
variable x and a single dependent variable y(x), which is determined 
by the requirement that S[y] be stationary. A simple generalisation 
is to integrands that depend upon several dependent variables yz (x), 
k= 1,2,...,n, and their first derivatives. This type of functional is 
required in the next chapter and the necessary theory is described here. 

(b) The integrand of (3.44) depends upon y(z) and its first derivative. 
Another generalisation involves functionals depending upon second or 
higher derivatives. Some examples of this type are treated in Exer- 
cises 2.32, 2.33 and 3.33, though we do not consider this type of problem 
any further. 

(c) The integral defining the functional may be over a surface rather than 
along a line, 


Oy O 
w= ff dx; dro F (x10 on x). (3.45) 


where D is a region in the (x1, x2)-plane, so the functional depends upon 
two independent variables, 7; and x2, rather than just one. Boundary 
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conditions are, of course, also needed to specify a unique stationary 
function. In this case the Euler-Lagrange equation becomes a partial 
differential equation. Many of the standard equations of mathematical 
physics can be derived from such functionals — the wave equation being 
one important example and this is considered briefly in the last, optional, 
section of Chapter 4 (page 190). There is also a natural extension to 
integrals over higher-dimensional spaces, but we do not consider these 
types of problems. 


(d) Broken extremals: a broken extremal is a continuous solution of the 
Euler-Lagrange equations with a first derivative which is discontinuous 
at a finite number of points. That such solutions are important is clear 
from the observation of soap bubbles: when two or more bubbles form 
a composite shape this usually comprises spherical segments such that 
across any common boundary the normal to the surface changes direction 
discontinuously. A simple example of such a solution is the Goldschmidt 
curve defined by equation (2.59) (page 57); see also Exercise 2.29. 


3.3.1 Two dependent variables 


In this Subsection we find the necessary conditions for a functional depend- 
ing on two functions to be stationary; however, because we are ultimately 
interested in functionals depending upon any finite number of variables, we 
shall often use the notation for which this further generalisation becomes al- 
most automatic. Most examples of functionals with two or more dependent 
variables arise in the study of dynamics and some will be introduced in the 
next chapter — here we consider only the general theory. 


We shall find necessary conditions for a functional depending upon two func- 
tions, yi(x) and yo(x), and a single independent variable x to be stationary. 
The functional is 


b 
Sly, ye! =| dx F (x, y1, 42,94; Y2) ; (3.46) 


where each function satisfies the boundary conditions 


yz(a)= Ap, Yr(d) = By, k= 1,2. (3.47) 


We require functions y;(x) and yo(z) that make this functional stationary 
and proceed in the same manner as before. Let y (x) and yo(x) be two 
admissible functions — that is, functions having continuous first derivatives 
and satisfying the boundary conditions — and consider the difference 


6 = Sly + gi, yo + €g2] — Sly, ya], lel <1, (3.48) 


where y,(x) + € g(x), k = 1,2 are also admissible functions; as in Chapter 2 
(page 45) this means that gz(a) = g,(b) = 0, k = 1,2. The analysis starts 
by finding the first term in the expansion of 6 as a power series in €, and for 
this we need first to expand the integrand, 


F(z,y1 + €91, y2 + €92, Y} + €91,¥5 + €99) — F (x, y1, Ye, y1, Yo) 
OF ,@F @F  ,OF 


=€ (m5 + ag + a + oer | +O(e*), (3.49) 


where all functions on the right-hand side are evaluated at « = 0. Hence, on 
writing 6 in the form 6 = « AS + O(e?), we find that 


b 
OF OF OF OF 
as= fa (> + —g, + — + 57a) 3.50 
: ay t By yy? * Bye” (3.50) 


3.3 Functionals containing many dependent variables 


with the integrand evaluated at « = 0. Notice that AS is the appropri- 
ate generalisation of the Gateaux differential defined in equation (2.14) 
(page 45), that is, 


AS = 


d 
J Slyi + €g1, yo + €g2| (3-04 | 


de e=0 
The expression (3.50) for AS is simplified when integrating by parts those 


terms involving gj(x) and g5(x): thus, for the first such term, 


oe an d (OF 
Lag |oag],- f oea (oe) sie 


with a similar expression for the term involving g5(x). Since gz,(a) = gz(b) = 
0, k =1, 2, the expression for AS becomes 


b d fpr OF 
as=- | oe = (5) 1 


‘ d (Or\ oF 
fo lae (aa) an 


For a stationary path we need, by definition (Chapter 2, page 45), AS = 0 
for all gi(x) and go(x). On setting go(xz) = 0 the above equation becomes 
the same as equation (2.30) (page 48) with y and g replaced by y; and gj 
respectively. Hence we may use the fundamental lemma of the calculus of 
variations, Section 2.3, to obtain the second-order differential equation 


d (OF OF 
Rises ae ll = Ae. b) = B.. es! 
te (Sr) dp 7 mle) = AL, m0) = Bi (3.54) 
This equation looks the same as equation (2.31) (page 48), but remember 
that here F' also depends upon the unknown function yo(2). 


Similarly, by setting g;(2) = 0, we obtain another second-order equation 


a for OF 
= (x) ee 0, yo(a) = Ao, yo(b) = Bo. (3.55) 
Equations (3.54) and (3.55) are the Euler-Lagrange equations for the func- 
tional. These two equations will normally involve both y;(x) and y2(x), so 
are named coupled differential equations; this usually makes them far harder 
to solve than the Euler-Lagrange equations of Chapter 2, which contained 
only one dependent variable. 


An example should help to make this clear. Consider the functional 


m/2 
Styi.uel= [de (vi? + uf? + 20) (3.56) 
0 
so that 
OF OF 
—— = 2y! d — = 2yo. 3.00 
Oy), Y, an Oy, Y2 ( ) 
Equation (3.54) becomes 
dy 
ae = fia =) 3.08 
a Y2 ) ( ) 
which involves both y;(x) and y2(x). Also, 
OF OF 
— = 2y) d —=2 3.09 
gee ag (3.59) 
and equation (3.55) becomes 
d*yo 
ne gy 0 (3.60) 


dx 
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which also involves both y;(x) and yo(«). 


Equations (3.58) and (3.60) now have to be solved. Coupled differential 
equations are normally very difficult to solve and their solutions can behave 
in bizarre ways, including chaotically; but these equations are linear which 
makes the task of solving them much easier. Also, solutions of linear equa- 
tions are generally better behaved. One method is to use the first equation 
to write y2 = y;, so the second equation becomes the fourth-order linear 
equation 


OSS, dag dag (3.61) 


This equation was treated in Block 0, example 1.28, so here we provide a 
mere outline of the method of solution. Suppose that the solution has the 
form y; = aexp(Ax), where a and X are constants; substituting this into 
the equation gives \* = 1, showing that there are four solutions obtained 
by setting A = +1, +7: the general solution is a linear combination of these 
functions, 


yi(x) = Acosx+ Bsinx +Ccoshz + Dsinhaz, (3.62) 
where A, B, C and D are constants. Since y2 = y//, we also have 
yo(x) = —Acosx — Bsinax + Ccoshz+ Dsinhz. (3.63) 


This general solution needs to satisfy the boundary conditions of the prob- 
lem: if a solution exists then the values of these constants are determined 
from the boundary conditions, as demonstrated in Exercise 3.5, below. It 
should be noted, however, that a solution may not exist, as shown in Exer- 
cise 3.6. 


Exercise 3.5 


Find the values of the constants A, B, C and D if the functional (3.56) has the 
following boundary conditions 


yi(0)=0, m(F)=1, ye(0)=0, yw (FZ) =-1. 
Exercise 3.6 


Consider the functional defined in equation (3.56), but with x in the range 0 < 
x <7. Show that if the boundary conditions are y;(0) = 0, y1(7) = a, yo(0) = 0, 
yo(7) = GB, then no solution exists unless a = 3, and then there are infinitely many 
solutions. 


Exercise 3.7 


Show that the Euler-Lagrange equations for the functional 


1 
Sly, ye] = / dx (yh° + y2" + y1y2) 
with the boundary conditions 
yi(0) = 0, yi(1)=1, yo(0)=1, yo(1) =2, 
integrates to 
2y, typ =a and 2y,4+y) = az, 


where a; and az are constants. Deduce that the stationary path is given by the 
equations 


yi(z)=a2 and yw(r)=a2+1. 


3.3 Functionals containing many dependent variables 


Exercise 3.8 


By defining a new variable z, = y; + y2/2, show that the functional defined in the 
previous exercise becomes 


1 
S{Z1, y2| = / dx (24° + 3 y),*) ) 
0 
with boundary conditions 
z1(0) = 7 z1(1) alts y2(0) = 2, y2(1) = 2, 
and show that the corresponding Euler-Lagrange equations are 


dz, d*y2 
7 = 0. and 72 


Solve these equations to derive the solution obtained in the previous exercise. 


= 0. 


Note that by using the variables (z1, y2) in the above exercise each of the 
new Euler-Lagrange equations depends upon only one of the dependent 
variables and is therefore far easier to solve. Such systems of equations 
are said to be uncoupled, and one of the main methods of solving coupled 
Euler-Lagrange equations is to find a transformation that converts them 
to uncoupled equations. In real problems, finding such transformations is 
difficult and often relies upon understanding the symmetries of the problem; 
then the methods described in Section 3.2 and Subsection 3.5.2 can be useful. 


Exercise 3.9 


Evaluate the expression defined in equation (3.51) to derive equation (3.50). 


3.3.2 Many dependent variables 


The extension of the above analysis to functionals involving any number, n, 
of dependent variables, their first derivatives and a single independent vari- 
able is straightforward. It is helpful, however, to introduce the notation 
y(x) = (y1 (x), yo(X),---,Yn(x)) to denote the set of n dependent variables; 
generally, we use bold characters to denote sets of variables. ‘There is still 
only one independent variable, so the functional is 


b 
Sly| “} dx F(z,y,y), y(a)=A, yo) =B, (3.64) 
where 


yw = (1(2), 9(2),---.Yalz)), 
A= (Aj, Ao,..-,An) and B = (By, Bo,..., Bn). (3.65) 


If y(x) and y(a) + € g(x) are admissible functions, so that g(a) = g(b) = 0, 
we again consider the difference 


5 = Sly + eg] — S[y] = « AS[y, g] + O(€*). (3.66) 


The Gateaux derivative, AS, is given by the relation 


d 


€=0 


b 
d 
=f dx aoe ey +eg,y + eg’) (3.67) 
a e=0 
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and for y to be a stationary path, this must be zero for all suitable g(z). 
Using the chain rule we have 


d is OF ,OF 

—F(rz,y+eg,y +g’ = (5 +g -), 3.68 

<F( Nog =e (ag, tay (3.68) 
hence 


AS = We dx (15 + de) (3.69) 


Now integrate by parts to cast this in the form 


i>) at a Ar ts 


i=] 


But, since g(a) = g(b) = 0, the boundary term vanishes. Further, since 
AS = 0 for all suitable g(x), by the same reasoning used when n = 2, we 
obtain the set of n coupled equations 


d (OF OF 
dx (ar) ; yx (@) ki Yaa ( ) ky 75 » ( ) 


This set of n coupled equations is usually nonlinear and difficult to solve. 
There is, however, one circumstance when the solution is relatively simple; 
this is when the integrand of the functional S[y] is a quadratic form in both y 
and y’, as in equation (3.72) in the following exercise, which is an important 
example because it describes small oscillations about an equilibrium position 
of an n-dimensional dynamical system. 


Exercise 3.10 This exercise is optional 
and requires knowledge of 
(a) If A and B are real, symmetric, positive definite, n x n matrices so Aj; = Aj; matrices. 


and B;; = B,; for 1 < 7,7 <n, consider the functional It can be shown that a real 
symmetric matrix has real 

dx Axiy’, Yy (3.72) eigenvalues: a positive 

=f 3 wi JAG Bi; i)» definite matrix is any 


matrix having only positive 
with the integrand being quadratic in y and y’. Show that the n Euler— eigenvalues. 


Lagrange equations are the set of coupled linear equations 


1 


d*y; 
>, Anja + Buys ==-{), 1< fe. 


j=1 


(b) el that if we interpret y as an n-dimensional column vector and its transpose 
y| as arow vector, the functional can be written in the equivalent matrix form 


Sly] -[ dx (y'' Ay’ —y' By), 


a 


and the Euler-Lagrange equations can be written in the matrix form 


d? 
A an — + By =0 
and that, on multiplication by A~!, this can also be written in the form 
d*y 
— + A 'By =0. (3.73) 


Whilst not part of this course, we now very briefly outline the theory which 
shows how the solutions of this equation can be found and how these solu- 
tions behave. The matrices A and B have, by definition, only positive eigen- 
bani, it can therefore be shown that A~!B has non-negative eigenvalues 
Toe k =1,2,...,n and that there are a set of n orthogonal eigenvectors zz, 


3.4 Changing dependent variables 


k =1,2,...,n, possibly complex, which diagonalise A~'B. Then if we express 
y as a linear combination of the Zz, 


= > enle) ee, (3.74) 
k=1 
it can be shown that 
d?a; 
ej + wa, | an ce oo as (3.73) 


These equations for a;(x) are trivially solved and the constants of integration 
determined by the initial value of y. Even without solving these equations, it 
is seen that in general the vector y comprises a sum of n periodic components 
with the frequencies w;, 7 = 1,2,...,n. We emphasise that these results are 
not an assessed part of the course. 


3.4 Changing dependent variables 


In this section we consider the effect of changing the dependent variables. A 
simple example of such a transformation was dealt with in Exercise 3.8, 
where it was shown how a linear transformation uncoupled the Euler— 
Lagrange equations. In general the aim of changing variables is to simplify 
the Euler-Lagrange equations, but it is usually easier to make the transfor- 
mation to the functional rather than to the Euler-Lagrange equations. The 
main use of this method is in Newtonian dynamics. 


Before explaining the general theory we deal with a specific example, which 
highlights all salient points. The functional is 


b 
Sly, y2] = / dx [3 (wi? +e) -Vir)], r= Vy? +93, (3.76) 


where V(r) is any suitable function. This functional occurs frequently be- 
cause, as will be seen in the next chapter, it arises when describing the 
motion of a particle moving in a force depending only on the distance from 
a fixed point. It is special because it depends only upon the combinations 
yi? + yi” and yf + y3, which suggests that changing to polar coordinates 
may lead to simplification. 


First, however, we need to be clear about the difference between the present 
example and that of Section 3.2.2, where polar coordinates were also used. 
Now we have two dependent variables, y; and yo, representing the Cartesian 
coordinates of the particle’s position, as shown in Figure 3.2; we shall use 
polar coordinates, r and 0, to replace these coordinates, but the independent 
variable, z, is unchanged. In Section 3.2.2 we had one dependent variable, 
y, and the polar coordinates were used to change both independent and 
dependent variables. 
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Y2 


Y1 


Figure 3.2. Diagram showing the relation between the Cartesian and polar 
coordinates (y1, y2) and (r,@), respectively 


The relations between (y1, y2) and (r,@) are yy = rcos@ and yo = rsin@, so 
that y? + ys — r* and, on using the product rule, 


d d dé d d dé 

-— = cos 8 as = sin@ and —- = oe meg. (3.77) 
Squaring and adding these equations gives 

yf? 4 ys — p24 prgf2 (3.78) 


Hence the functional becomes 


Siro] = r dx |5r'* + $r°0'* —V(r)]. (3.79) 


Exercise 3.1] 


(a) Show that the Euler-Lagrange equations for the functional (3.76) are 
d” yi 
er 


Y1 d? yo 17) #2 
V'(r)=— =0 d —>4V'\(r)— =0. 3.80 
+V(n4=0 and S240 (r4 (3.80) 
(b) Show that the Euler-Lagrange equations for the functional (3.79) can be writ- 
ten in the form 
rr 4 dé iL 
dx rr?’ 
where L is a constant. Note that the equation for r does not depend upon @ 
which is obtained from r(x) by a single integration, so these equations are easier 


to solve. This exercise shows how useful changing the dependent variables can 
be, provided the correct choice is made. 


(3.81) 


Exercise 3.12 


This is an algebraically complicated problem which aims to show directly that it is 
easier to transform the functional rather than directly attack the Euler-Lagrange 
equations. You may omit this exercise, but it is instructive and will help you 
understand why the theory of the next chapter is so useful. 


(a) By finding expressions for y// and y¥ in terms of the derivatives of r and @, Hint: multiply the first 


show that equations (3.80) become equation by cos 6, the 
2 xi » peepee: ; second by sin @, add the 
(r" — r6’*) cos — (r0” + 2r'6’) sin 8 + V'(r) cos@ = 0, results and use the identity 
(r” — r6'*) sin + (r6" + 2r’6’) cos + V'(r) sin @ = 0. eos’ 0+ sin’ @ = 1. The 
second equation is obtained 
(b) By rearranging these equations, show that by using a similar trick. 


| d 
"_ rf21V'"ir) =0 d —(r76’) =0. 
r r (r) an =, (r ) 0 


3.4 Changing dependent variables 


The general theory is not much more complicated that these examples. Sup- 


pose that y = (y1, y2,---,Yn) and z = (z1, 22,...,%n,) are two sets of depen- 
dent variables related by the equations 
te = Opel oe), 2. (3.82) 


where we assume, in order to slightly simplify the analysis, that each of the 
wy, is not saceiiad dependent upon x. The chain rule gives 


ae dx’ 


showing that each of the y, depends linearly upon the z/. It is necessary 
to assume that the transformation between the derivatives is invertible and 
it may be shown that the condition for this is that the determinant of the 
matrix with elements Oy,,/0z; is non-zero at all points of the region, which 
is also the condition for the transformation between y and z to be invertible. 


Under this transformation, the functional 


b b 
Stul = | dx F(x,y,y’) becomes ste = | hans (elpesz es BBS) 


where 


Gls, 22 )= 


F (set ), Po(z) nla), Do oe as Hs): (3.85) 


i=] 


that is, G(a,z,z’) is obtained from F(x,y,y’) simply by replacing y and 
y’. In practice, of course, the transformation (3.82) is chosen to ensure that 
G(a, 2,2’) is simpler than F(z, y, y’). 


Exercise 3.13 
Show that under the transformation to cylindrical polar coordinates, 


Yi =pcosg?, yo=psingd, y3=z2Z, 


the functional 


b 
Stu. ve.uel = | dx [5 (yi +yo° +3") -—V(o)], p= fue +93, 


becomes 


b 
S[o,d,2]= f de [3 (p'? + p70"? + 2!) -V(p)]. 


Find the Euler-Lagrange equations and show that those for p and z are uncoupled. 
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3.5 Symmetries (Optional) 


In previous examples, for instance the minimal surface area of Section 2.5, 
and the brachistochrone of Section 2.7, we solved Euler’s equation by using 
the fact that if the integrand, F'(y,y’) does not depend explicitly upon z, 
that is, OF /Ox = 0 then 


This result is derived in Exercise 2.6 (page 51) and is important because it 
reduces the second-order Euler equation to the simpler first-order equation 

5a — F = constant. [a6 
The proof of this result given in the solution to Exercise 2.6 is algebraic and 
relied only on the fact that OF’ /Ox = 0. In the following optional subsection 
we re-derive this result using the equivalent but more fundamental notion 
that the integrand F(y,y’) is invariant under translations in x: this is a 
more fruitful method because it is more readily generalised to other types of 
transformations; for instance in three-dimensional problems the integrand of 
the functional may be invariant under all rotations, or just rotations about 
a given axis. The general theory is described in Section 3.5.2. 


The algebra of the following analysis is fairly complicated and requires care- 
ful thought at each stage: this material cannot be read and understood 
quickly and for this reason, this section is optional. However, we encourage 
you to attempt to understand it because this theory is important for the 
understanding of advanced dynamics and also, it is a very elegant piece of 
mathematics. This material also illustrates the important role that sym- 
metries play in simplifying the mathematical description of nature, as was 
noted in Block I, Chapter 5, in the discussion of vibrating membranes. 


3.5.1 Invariance under translations 


We develop the general method by applying it to the example treated in 
Exercise 2.6 (page 51), in which there is a single dependent variable, x and 
where the integrand does not depend explicitly upon x, that is OF /Ox = 0, 
so can be written as 


b 
Sly] = / dx F(y,y), yla) =A, y(b) = B. (3.88) 


The admissible functions, y(xz), describe curves C’, between the points (a, A) 
and (b, B), in the two-dimensional space with axes Oxy, so that a point, P, 
on the curve has coordinates (x, y(x)), as shown in Figure 3.3. 


Y 


L 


O 


O 


Figure 3.3. Diagram showing the two coordinate systems Oxy and O%Y, connected 
by a translation along the x-axis by a distance 0. 


3.5 Symmetries (Optional) 


Another coordinate system could be OF y, where = x — 6 and Y = y, with 
the origin, O, of this system at x = 6, y = 0 in the original coordinate system, 
that is a translation a distance 6 along the x-axis. In this coordinate system 
the curve C’ has the equation 7 = y(Z), so the coordinates of a point P are 
(x, y(x)) and these are related to coordinates in Oxy, (x, y(x)), by 


Z=x2-6 and 7(£)=y(r) or F(z) = y(F + 0), (3.89) 


the latter equation defining the function y; differentiation, using the chain 
rule, gives 
dy dyaxn dy 


5 5: peemn0g 
meee 7 ee ee (3.90) 


The functional (3.88) can be computed in either coordinate system and, for 
reasons that will soon become apparent, we consider the integral, T'[y | in 
the Oy representation over a limited, but arbitrary, range ¢ < Z < d, 
Pipl = [ duP@u).7(w)) where ou) = 2, 
a du 
¢=c—6,d=d—dSanda<c<d<pb. The integrand of T depends on u 


only through the function y(u): at each value of u the integrand has the 
same value as the integrand of S[y] at the equivalent point, x = u+ 6. 


(3.91) 


Now follows the crucial step: because F’ does not depend explicitly upon 
the independent variable it follows that 


d d 
/ du F(y(u), y (u)) =| dx F(y(x),y'(z)), where rx =u+6, (3.92) 


Cc 


and this is true for all 0. 


Now consider small values of 6 and expand to O(6), first writing the integral 
in the form 


d—6o 
= / du F (G(u), 9 (u)) 


= 
d c d 
= du F(y, 7) + | du F(¥,7 ) -| du F(y,7'). (3.93) 
e c—é d—6 
This expression can be expanded to first-order in 6 by using the results 
/ du g(u) = g(z) 6 + O(6"), (3.94) 
z—0d 
and 
y(u) = y(u+d) = y(u) + y'(u) d+ O(6*), (3.95) 
and also 
7 (u) = y'(u) + y"(u) d+ O(8). (3.96) 


Now consider the three terms of equation (3.93) separately. To first-order, 
the first term is 


d d 
= / du F (9,7) = / du F(y+y'd,y' + y"5) + O(6). (3.97) 


(if 
Using the Taylor expansion 
/ / I" OF / OF iT] 2 
Fiy+ydo,y +y = Pe ge 5+ 5G y 6+ O(6*), (3.98) 
where all derivatives are evaluated with 6 = 0, this becomes 


: OF 0 
T, =| du Py iar | ey i" Oe). (3.99) 
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The second and third terms of equation (3.93) have the same form, 


[du FU.7) = FO2),T(2)) 6+ 00) 


= F(y(z),y/(z)) d+ O}), (3.100) 
hence 
¢ d q 
[ wrag)-[ dura) =-s[rowi+o@). (201 
c—é d—6d 
On combining these relations the expression (3.93) for T’ becomes 
: OF OF 
a / ele oe oe 
r= | du Pus) +54 +O 7Y | 


— 6[F(y,y)]¢ + O(8). (3.102) 


Because of equation (3.92) this gives 


d 
OF OF d 
=5] du (y—+y"— | -6/F(yy’ O(5°). 3.103 
0-8 [du (v5 tu" Ge) ~5[FUw +08) (3.103) 
Now integrate the second integral by parts, 
¢ OF 22 air i d (OF 
(— = }y’/—_ | — du y’ — | — )}. 3.104 
I - Oy E al | OY du ian sisi 


Substituting this into (3.103) gives 
OF : d (O8F\ OF 
=d\y——-F| -—6] duy' |—(—})-—!/+0(6). (3.105 
; E y | I ts raya lt eS 
But y(u) is a solution of the Euler-Lagrange equation, so the remaining 
integrand is identically zero, and hence 


OF OF 
oe = | F-y/— ) 
G : a) pame ( ‘eg pg, 


Finally, we observe that c and d are arbitrary and hence, for any x in the 
interval a < x < b, the function 


(3.106) 


F(y,y') — De EAC y’) = constant. (3.107) 


Because the function on the left-hand side is continuous the equality is true 
in the interval a < x < b. This relation is always true if the integrand of the 
functional does not depend explicitly upon 2, that is, OF /Ox = 0. It relates 
y (x) to y(a) and by rearranging it we obtain one (or more if, for instance, a 
square root is involved) first-order equation for the unknown function y(z), 
which is generally easier to deal with than the second-order Euler-Lagrange 
equation. Of course, by differentiating equation (3.107) with respect to x 
the Euler-Lagrange equation is regained, as shown in Exercise 2.6 (page 51). 


The function F — y’/0F/Oy’ is named a first integral of the Euler-Lagrange 
equation, this name being suggestive of it being derived by integrating the 
original second-order equation once to give a first-order equation. For the 
same reason in dynamics, quantities that are conserved, for instance energy, 
linear and angular momentum, are also named first integrals, integrals of 
the motion or constants of the motion, and we shall see in the next chapter 
that these dynamical quantities have exactly the same mathematical origin 
as the first integral defined in equation (3.107). 


3.5 Symmetries (Optional) 


As an example of a functional that is not invariant under translations of the 
independent variable, consider 


b 
Jig] = / dxxy'(x)*, y(a)= A, y(b) = B. (3.108) 


It is instructive to go through the above proof to see where and how it breaks 
down. In this case equation (3.92) becomes 


d d 
Jly| = / duuy’(u)? = / dx(x—6)y'(x)*, z=uto 


d 
a / dex y'(«)? # Sly, (3.109) 


This proof of equation (3.107) may seem a lot more elaborate than that 
given in Exercise 2.6 (page 51). However, there are circumstances when the 
algebra required to use the former method is too unwieldy to be useful, and 
then the present method is far more transparent. An example of such a 
problem is given in Exercise 3.33. The following exercise is long and fairly 
difficult, but it revises all the steps of the preceding analysis. 


Exercise 3.14 


In this exercise we consider a functional, 


b 
Sly] = / dz F(x,y,y') (3.110) 


that remains unchanged when the dependent variable is changed to % where x = 
(1 + 6)%, for some constant 6. This represents a change of scale in x, rather than 
a translation, as considered in the text. 


This means that for any c and d, where a <c <d< b, equation (3.92) is replaced 
by 


d d 
-> | du F(u,y(u), 9 (u)) = / dx F(x, y(x),y"(x)), (3.111) 
where 7/(Z) and y(z) are related by the identity 9(Z) = y(x), with « = (1+ 0)@. 
(a) Show that 


<4@) = +a that is '(%) = (1+ 4)y"(2). 


(b) Show that, to first-order in 6, 


_ 5 du F(u,7,y’) + O(6"), (3.112) 
= 
and that 
U(u) = y(u) + y!(u)ud + O(6*) 
and 


7 (u) =y/(u) + 6(y'(u) +uy"(u)) + O18). 


(c) By expanding the right-hand side of equation (3.112), show that, to first-order, 
this equation becomes 


d 
F F 
o= | = Geet ol + uy") : 


rv d 
Oy Oy! — pal fa th oh Ne 
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(d) Use integration by parts to show that 


[ve 0 OF _ ages] id (OF 
- ve * By! oe By! ‘ : ay “By! 


and hence that, if y(u) is a stationary path of S, the relation derived in part (c) 
becomes 


d 
= c (5a -F) for all c and d. 


U=C 
Deduce that for a functional satisfying the given invariance, the first integral 
is 


OF 
aF — xy’— =constant for all a <x < b. 


Oy! 


(e) If F = x%y'", for some constants a and 3 ¥ 1, show that the first integral de- 
rived in part (d) reduces to y’(a%) = Ax~7 for y = (1+ a)/@ and some constant 
A. Show that the solutions of this equation are 
Alnz+ B, i: 
y(x) = 


a 474+ B, yA 1. 

= 
Explain why only the first of these solutions, y = 1, is a stationary path of a 
functional invariant under a scale transformation. 


Later, when we discuss the variational formulation of Newtonian dynamics, 
it will be seen that the equivalent of equation (3.107) becomes the conserva- 
tion of energy in those circumstances when the forces are conservative and 
are independent of the time; that is, in Newtonian mechanics energy conser- 
vation is a consequence of the invariance of the equations of motion under 
translations in time. Similarly, we shall see that invariance under transla- 
tions in space gives rise to conservation of linear momentum, and invariance 
under rotations in space give rise to conservation of angular momentum. 


3.5.2 Noether’s theorem 


In this section we extend the previous analysis to deal with functionals 
having several dependent variables. The analysis is a straightforward gener- 
alisation of that presented above but takes time to absorb, so we recommend 
that you do not try to understand it properly until finishing Chapter 4, by 
which time the significance of Noether’s theorem will be clearer. For this first 
reading, try to understand the fundamental ideas and try to avoid getting 
lost in algebraic details. That is, you should try to understand the definition 
of an invariant functional, the meaning of Noether’s theorem, rather than 
the proof, and should be able to do Exercises 3.16-3.18. 


There are two ingredients to Noether’s theorem: 


(i) functionals that are invariant under transformations of both dependent 
and independent variables; 


(ii) families of transformations that depend upon one or more real parame- 
ters. 


We consider both of these elements in turn in relation to the functional 


b 
stu) = | ALT 2, 9,.8 yy ies te), (3.113) 


which has stationary paths defined by the solutions of the Euler-Lagrange 
equations. Notice that we have not mentioned the boundary conditions: 
this is because they play no role in the general theorem. 


3.5 Symmetries (Optional) 


The value of the functional usually depends upon the path taken, which in 
this section is not always restricted to being stationary. We shall consider 
the change in the value of the functional when the path is changed according 
to a given transformation: in particular, we are interested in those transfor- 
mations which change the path but not the value of the functional. 


Consider, for instance, the two functionals 


A path y can be defined by the pair of functions (f(x), g(x)), 0< 2 <1, 
and on each 7 the functionals have a value. 


Consider the transformation 


Yj =yicosa—yeosina, Y1 = Y,cosa+ Yosina, 

a with inverse = ag * 145) 

Yo = yisma+ y2 cosa, y2 = —Y, sina + Yo cosa, 
which can be interpreted as an anti-clockwise rotation in the (yj, y2)-plane 
through an angle a, independent of x. Hence under this transformation the 
curve ¥ is rotated bodily to the curve 7, as shown in Figure 3.4. 


Rotated curve 


Original curve (f(x), 9(x)) 


Figure 3.4 Diagram showing the rotation of the curve y anti-clockwise through an 
angle a to the curve 7%. 


The points on 7 are parametrised by (f(x), g(x)), 0 <x <1, where 
f=fcosa—gsina and g=fsina+gcosa. (a. bie.) 
Hence on the path y, the functional 5; has the value 
1 
Six) = fae [f'@)? + 9'(e)| (3.117) 
0 
and on the path 7 it has the value 
1 1 
8i(9) = fae [Fe +9@'| = f ar [@P+9'@"]. B18) 
0 0 


Hence the functional has the same value on y and on 9 for all a. That is, 
S;[y] is invariant with respect to the rotation defined in Equations (3.115). 


On the other hand, the values of S2 on 7 are 


i 
$9(+) = / dx [f'(x)? + 9/(a)?] f(a), (3.119) 


0 
and on 7¥: 


z / dx [f'(x)? + g'(x)?] [f(«) cosa — g(x) sin a] 


1 
= $o(7) cosa — sina | dx | f’(x)* + g'(x)?| g(a). (3.120) 


See Exercise 3.15. 
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In this case the functional has different values on y and on ¥, unless a is an 
integer multiple of 27. That is, Soly] is not invariant with respect to the 
rotation (3.115). 


Exercise 3.15 


Derive equations (3.118) and (3.120). 


The transformation (3.115) does not involve changes to the independent 
variable x, but the transformation considered in the previous subsection in- 
volved only a change in the independent variable, via a translation along 
the x-axis; see Figure 3.3. In general, it is necessary to deal with a transfor- 
mation in both dependent and independent variables, which can be written 
as 


t= O(c,y,9) ond 3 = Usage), F=—1,22...% (3.121) 


We assume that these relations can be inverted to give x and y in terms of & 
and y. For any curve y, defined by a specific function y = f(x),a<a2<b, 
this transformation moves y to another curve 7 defined by the equation 


¥ = f(z). 


Definition: The functional (3.113) is said to be invariant under the trans- 
formation (3.121) if 


G=G, (3.122) 


where 


d > d 
= d 
G= | Gf f | te, dy and G= / dx F\ x,y, as (3.125) 
“ dx . dx 


for all c and d satisfying a < c < d < b, with @ and d given by equa- 
tion (3.121) by putting x equal to c and d, respectively. 


The meaning of the equality G = G is easiest to understand if ¥ = x. Then 
the functions y(x) and y(a) define two curves, y and ¥ in an n-dimensional 
space, each parametrised by the independent variable x. The functional G is 
the integral of F(x, y, y’) along y and G is the integral of the same function 
along 7. 


By contrast, in the case x ~ @ the only difference is that the parametrisation 
along y and 7 is changed. In practice, the usual change to the independent 
variable, x, is the uniform shift 7 = «+6, where 6 is independent of x, y 
and y’; this is the example dealt with in the previous subsection. 


A one-parameter family of transformations is the set of transformations 
c= O99 4) aad 9, S877 ee), ©£—1,2,....% tole 


depending upon the parameter 6, which reduces to the identity when 6 = 0, 
that is, 


o=@(2,9,9 0) and yo = Vice), £=1,2,....n,, 95) 


and where ® and all the YW; have continuous first derivatives in all variables, 
including 0. This last condition ensures that the transformation is invertible 
in the neighbourhood of 6 = 0. An example of a one-parameter transfor- 
mation is defined by equations (3.115), which becomes the identity when 
a=). 
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Exercise 3.16 

Which of the following is a one-parameter family of transformations? 
(a) F=x-yd, Y=ytczo. 

(b) ©=acoshéd—ysinhéd, Y= —ycoshd+ ysinho. 


(c) =a and y = yexp(Ad), where A is a square non-singular n x n matrix, and 
the exponential of a matrix is defined by the infinite series 


ea 
exp(Ad) = ) =1+4+ Ad + 4$A7S? + EARP H--, (3.126) 
k=0 


Families of transformations are very common and are often generated by 
solutions of differential equations, as illustrated by the following exercise. 


Exercise 3.17 


(a) Show that the solution of the equation 


d 
= =y(l1—y), O<y(0) <1, SB 
is 
= U(z,t) = a, ee (0) (3.128) 
an 1+ (eb -1)2’ ee 


(b) Show that this defines a one-parameter family of transformations, y = V(z,t), 
with parameter tf. 


Exercise 3.18 


Show that the functional 


b 
Sly] = / dx (y\* — yo") (3.129) 
is invariant under the transformation 
Y¥, =yicoshd+yesinhé, Y.=yisinhd+yecoshd, Z=2+6 (2), (3.130) 


only if g(x) is a constant. 


We have finally arrived at the main part of this section, the statement and 
proof of Noether’s theorem. ‘The theorem is far easier to understand than 
its proof, so after reading the theorem we suggest that you do Exercises 3.20 
to 3.22, leaving the actual proof until you have time to understand it. The 
theorem was published in 1918 by Emmy Noether (1882-1935), a German 
mathematician, considered to be one of the most creative abstract alge- 
braists of modern times. The theorem was derived for certain variational 
principles, and has important applications to physics, especially relativity 
and quantum mechanics, besides systematising many of the known results 
of classical dynamics. 


The theorem deals with arbitrarily small changes in the coordinates, so in 
equation (3.124) we assume |6| < 1 and write the transformation as 


O®P 
zt=2+¢(z,y,y’)6 + O0(5), o= “AS ’ 
00 |5_6 
(3.131) 
Ow 
Vk = + uy, (2, y, y’)d + O(5°), Wy = — ’ 
00 |5_o 
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where we have used the 6 = 0 limit defined after equation (3.124). In all 
subsequent analysis, second-order terms in the parameter, here 6, are ig- 
nored. 


Exercise 3.19 
Show that, to first-order in a, the rotation defined by equations (3.115) becomes 


YW=yMt+yYa, YY =—4ye, 
Yo =Yotyoa, Yo=Y%1- 


Noether’s theorem 


If the functional 


d 
stul = | dx F(x, y,y') (3: To) 


is invariant under the family of transformations (3.131), for arbitrary 
c and d, then 


oe a OF 
—w+{F- _—— = constant. 3.133 


along each stationary path of S|y]. 


The function defined on the left-hand side of equation 3.133 is often named 
a first integral of the Euler-Lagrange equations. Equation (3.133) is the 
significant result and is the required generalisation of the simpler result first 
derived in Exercise 2.6 (page 51) and used in Section 2.5 to find the minimum 
surface of revolution, and in Section 2.7 for the brachistochrone problem. 


In the one-dimensional case, n = 1, and when y = y (WwW =0) and d= 1, 
equation (3.133) reduces to the result derived in the previous section, equa- 
tion (3.107) (page 114). The proof of Noether’s theorem is given after the 
following exercises. 


Exercise 3.20 


Use the fact that the functional 
b 
Stul = f de (yi? +44?) 


is invariant under the rotation defined by equations (3.115), and the result derived 
in Exercise 3.19, to show that a first integral is 


Y1 Y> = YoY} = constant. 


3.5 Symmetries (Optional) 


Exercise 3.21 
Show that the functional 
b 
Sty] =| da (yi* + 97) 


is invariant under each of the following three transformations only if g(x) is a 
constant. 


(a) Yi=ytg(x)6, Yo= ye, E=2 
(b) Y=, Go = wo + ole)d,. F==. 
(c) W=M, Yo = Ya; Sat eit )d 


In the case g(x) = 1, show that these three transformations lead to the following 
first integrals, respectively. 


(a) y; =constant (b) y=constant (c) yj}? +y%? =constant 


Exercise 3.22 


Show that the functional 
b 
Sly] = / dx [5 (wy" + 93°) + Vm — y2)] 
a 
where V(z) is a differentiable function, is invariant under the transformation 
Yi=YMtog(z), Yo=yotdg(z), T=x 
only if g(x) is a constant, and that in this circumstance a first integral is 


y; + Y5 = constant. 


Exercise 3.23 


Show that for the scale transformation « = (1+ 6)% considered in Exercise 3.14, 
@ = —«, and that in this case the first integral given in equation (3.133) is the same 
as that derived in Exercise 3.14. 


Proof of Noether’s theorem 


Noether’s theorem is proved by substituting the transformation (3.131) into 
the functional (3.132) and expanding to first-order in 6. The algebra is 
messy, SO we proceed in two stages. 


First, we assume that % = 2, that is, ¢ = 0, which simplifies the algebra. It 
is easiest to start with the transformed functional 


—_ {* _ dy oo 
=) dF (oe, = (since Z = 2). (3.134) 
: £ 


Now substitute for y and y’ and expand to first-order in 6, to obtain 


d 
T= {| dc F (xy + be,y/ +558) 


[  i-egae OF dw 
= F . ) ed io oe © 
/ dx (x,y, y’) =f | dx 2 (= Wp + wae (3.135) 


But the first term is merely the untransformed functional which equals the 
transformed functional — because it is invariant under the transformation. 
Also, using integration by parts 


d OF dy, i) gel anal ta d (OF 
fe sgae = [Mog], ~ fae (az) _ 
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and hence, by substituting this result into equation (3.135) we obtain 


eS folk safe SoS -£(R)}- um 


The term in curly brackets is, by virtue of the Euler-Lagrange equations, 
zero on a Stationary path. It follows that 


ee “ae 


(3.138) 
k=1 OVE lo=d k=1 OUR 


“L=C 


and since c and d are arbitrary, we obtain equation (3.133), with ¢ = 0. 


In the second stage of this analysis we consider the general case, 6 ¥ 0, 
which proceeds similarly but is algebraically more complicated. As before 
we start with the transformed functional, which is now 


2 = 
T= / dz F (z Y; =| : (3.139) 
7 dx 


where d= d+6¢(d), €=c+60(c), with o(c) denoting d(c, y(c), y/(c)) and 
similarly for ¢(d). Now we have to change the integration variable and 
limits, besides expanding F’. First consider the derivative dy/dzZ; using 
equations (3.131) and the chain rule, 


dy dy dy 
—<— = | aeeak ee ee ee t4 
dt (2 eS dx Fe dz’ — dx : i | caw 


and so, to first-order in 0, we have 


dy _ (# y +53) (1 - 6) - 7 +5(S - ea). (3.141) 


dx dx dx dx a2 dd 
Thus the integral becomes 
d = 
-_ dx dy dw dyddo 
i= —F dw, — — — —— }}. 142 
/ ar n+ 50.44 ve +o(F al —— 


Now expand to first-order in 6 and use the fact that the functional is invari- 
ant. After some algebra we find that 


OF dy,\ dé OF 
o=6 [ oa (2 d dyf e) e Ox 
OF OF du, 
+ (a5 + Pu +a) (3.143) 


Now integrate those terms containing the total derivatives of ¢ and y, by 
parts to cast this equation into the form 


— OF dy, es 
(P-L age) oD ay 


+6 f° dx D4 | - 4 = (5) . (3.144) 


Notice that if ¢ = 0 this is the same as equation (3.137). Finally, we need to 
show that on stationary paths the integrals are zero. The second integral is 


3.6 Further Exercises 


clearly zero, by virtue of the Euler-Lagrange equations. On expanding the 
integrand of the first integral the term in curly brackets becomes 


OF OF OF 
Elle “th-+ 5) ee Yi 


On 
é. _d (OF 


Using the Euler-Lagrange equations to modify the last term, it is seen that 
this expression is zero. Hence, because c and d are arbitrary, we have shown 
that the function 


OF — OF 
(ry 7 0 rh o+ do Year (3.146) 


where y(z) is evaluated along a stationary path, is independent of x, and is 
a constant. 


3.6 Further Exercises 


Exercise 3.24 


Using the functional 


Sly] =| daz (y'* — w*y*) 


and the change of variable z = x 
is transformed into 
Joy 
ae 


L/ © show that the differential equation y”” + w7y = 0 


d 
+(1—-c) = er a soil o—2 | 3 


Exercise 3.25 


Show that the Euler-Lagrange equations for the functional 


b 
Sly1, ya] = / dx F(y;,y), 


which depends only upon the first derivatives of y; and yo, are 


25 2 2R 
mM + agg =" Byatt + Bet =o 
Deduce that, provided the determinant 
O°F Or 
; Oy,” Oy, Oy’ 
O°F O°F 


Oy Oy, — Oys? 
is non-zero, the stationary paths are the straight lines 
yi(x) = Arvt+B, y(x)=Cax+D, 
where A, B, C and D are constants. Describe the solution if d = 0. 


What is the equivalent condition to d £ 0 if there is only one dependent variable? 


123 


124 Chapter 3 Further developments of the theory 


Exercise 3.26 


If (x, yi, y2) is any twice-differentiable function, show that the functionals 


b 
Siln.ve] =f da F (2, 91, 92,915 Vo) 


and 
b 
Sealy, y2| =| dx [F (©, Yi, 9251, Yo) Ph; Yo, Vises) 
where 
Ob O® O® 
WU ae ee ae ee / 


lead to the same Euler—Lagrange equation. 


Note that this is the direct generalisation of the result derived in Exercise 2.28 
(page 71). 


Exercise 3.27 


Consider the two functionals 


b 
Sily1, y2] = i da [5 (i> + yo") + gi(x)y, + 92(x)ys — V (2, 91, yo) 


and 


b 
Selyi, y2| = / dx E as + ys”) — V(x, y1,y2)| 
where 


V=V4+ai(z)yi + 95(x)ye. 


Use the result proved in the previous exercise to show that they have identical 
Euler-Lagrange equations. 


Exercise 3.28 


(a) Show that the Euler-Lagrange equation of the functional 


sil= [dee Va=eV 
0 


Te ee 
— (3e~* — 1) nee (3.147) 
(b) Show that the change of variables u = e~*, with inverse x = — In u, transforms 


this functional to 
1 
or |= : du,\/Y(u)+ Y’(u), Y(u) =y(—Inu), 
0 


and that the Euler-Lagrange equation for this functional is 


ay dY 
ate or =e (3.148) 


(c) By making the substitution « = —Inu, show directly that equation (3.148) 
transforms into equation (3.147). 


3.6 Further Exercises 


Exercise 3.29 


Show that the stationary paths of the functional 


with the boundary conditions y(0) = 0, z(0) = 0, y(7/2) = 3/2 and z(/2) = 1/2, 


mw /2 
Sly, z] =| dx (y!* + 2? + 2yz), 
0 


satisfy the equations 


d*y az 
ge 


dx? ae eee 


Show that the solution of these equations is 


sinh x sinh x 


ia ee oe Ue 
yz) ee aa sinh(1/2) 2 


Exercise 3.30 


Show that if y = G(z), where G(z) is differentiable, the functional 


b 
Sly| = / dx F(«,y,y') 


transforms to 


b 
Suis / dz F(x, Glz), G2), 


with associated Euler-Lagrange equation 


eS ce (2) OF OF ; OF 
dx ‘ Oy’ 


Exercise 3.3] 


Consider the functional 


stl = f de®, y(t)=A, y2)=B 


where A and B are positive and different. 


(a) 


1 
f. hat = ——_, 
Using the fact that dy/dx eee 
variable the functional becomes 
B 
1 
Sl]= f dyso, 2(A)=1, 2(B)=2 


where x’ = dx/dy. 


Show that the Euler-Lagrange equation for the functional S{z] is 


d 1 Y - 
dy \ x2x!? aU 


which simplifies to 


dr 2 (dx? 
—~+-|{|—] =0.. 
dy? « \ dy 
By writing this last equation in the form 


i ¢ d 

= (=) =), #(Alb=1, 218) =, 

and integrating twice, show that the required solution is 
B-A | 


show that if y becomes the independent 
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3.7 Harder Exercises 


Exercise 3.32 


The ordinary Bessel function, denoted by J, (x), may be defined to be proportional 
to the solution of the second-order differential equation 


d°y dy 
2 2 >. a _ 
which behaves as (#/2)” near the origin. See Block I, Chapter 5, 


Subsection 5.2.5. 
(a) Show that equation (3.149) is the Euler-Lagrange equation of the functional 


Ful= [ ae ey'(w? - (e- “) u(e)?], X>0, 


where the admissible functions have continuous second derivatives for 0 < x < 
X. Note that the derivation of equation (3.149) from this functional requires 
care because there is no boundary condition at « = X. This omission can be 
rectified using the fact that the differential equation is linear. 


(b) Define a new independent variable u by the equation x = f(u), where f(u) is 
invertible, and set w(u) = y(f(u)), to cast this functional into the form 


Fw) = f ‘ du axe’ (uy - (Fs — 2 ae wu)? | 
where f(uo) = 0 and f(u1) = X. 


(c) Hence show that if f(u) = e”, w(u) satisfies the equation 


Te +(e" —n*) w =0, (3.150) 


Exercise 3.33 


Show that the Euler-Lagrange equation of the functional, first considered in Exer- 
cise 2.33 (page 73) 


b 
Sty] = / dz F(x,y,y',y'), 
with the boundary conditions 


y(a)=Ai, y’(a)=Az, y(b)= Bi, y'(b) = By, 
has the first integral 


d OF OF 
7. Dy! — dy! = constant 


if the integrand does not depend explicitly upon y(a), and the first integral 


y OF da OF _ OF )_ pe oe 
y Dy ay y = constan 


if the integrand does not depend explicitly upon x. 


[Hint: the second part of this question is most easily done using the theory described 
in Section 3.5.1.] 


Solutions to Exercises in Chapter 3 


Solutions to Exercises in Chapter 3 


Solution 3.1 


(a) If x = u? the chain rule gives 


dy dy du __ dy . . ae 


dx dudx dudz/du 2u du’ 


where Y(u) = y(u7), so the functional becomes 


U 2 
tL. fay 
S|IY| = 2 d ——— —w*y? — 71/2 
LY] / we (3 (%) WwW ) where u=2/*, 


1 : 1 i2 2y72 
0 


The Euler-Lagrange equation is now 
"a i a 
— (~) +4uw*Y = 0, 
a 


which expands to uY” — Y’ + 4u°w*Y =0. With w= 1 this gives equa- 
tion (3.1), with a = 2. 


(b) The second derivative is obtained using the chain rule again, 
te af 1 i.) 1 ay ae 
dz? du \2u du ] dx/du— 2u du \ 2u du 


Se a ee ae 
~— Qu \2u du2—s- 2u2 du ] ae du )~ 


Hence y” + wy = 0 becomes 
i et red 
—~|u 
4u? du* du 


giving the same Euler-Lagrange equation for Y(u), as before. 


Solution 3.2 


(a) Here the integrand is F = y’? so OF /Oy’ = 2y’ and the Euler-Lagrange equa- 
tion is y” = 0; alternatively apply the first integral, equation (2.33) (page 49), 
to give y/? =c. 


(b) If y= G(z) the chain rule gives 


| ee 


and the functional becomes 
b 
S| = / deG (zy 2". 


Now the integrand is F = G’(z)*z'?, giving 


OF OF 

rt 2G"(z)*z' and a 2G" (z) G'(z) 2", 
so the Euler-Lagrange equation becomes 

d 


=~ (G'(z)?2') — G"(z) G'(z) 2’? =0. 


But 
d 
= (irs) ae Giers” pe aC" (2) G’(z) 7°. 
z 
and hence the Euler-Lagrange equation becomes 


Cay zl! he G"(z)G"(z)z"* ie 0, 
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which is the result quoted. 


Now make the same change of variables in the original Euler-Lagrange equation 
y’ =0. The chain rule gives 


a OU =. Joa a s dz\° 
rs a (2) and 7 = (a + G'(z) a 


which leads to the same equation as derived above, provided G’(z) 4 0. 


Solution 3.3 
(a) With the polar coordinates x = rcos6@, y = rsin9, the derivative y’(x) is given 
by equation (3.25) (page 100), hence 
ne digs 
iG == = 5: 
(r’ cos @ — rsin 6) 


Since we are assuming that y'(x) is bounded, from the definition of admissible 
functions, r’ cos@ — rsin@ # 0 on the curve. Here, for simplicity, we assume 
r’ cos? —rsin@ > 0: in the opposite case the analysis changes slightly, but the 
final result is the same. The functional becomes 


Str] / ees ee 
Jo, dO r'cos6 — rsin@’ 


where tan 6, = y(a)/a and tan = y(b)/b. But by equation (3.22), 


Oy 
=r’ cos —rsin8, hence sri= | dO rv/r2 + r!2., 
Oa 


If F =rvVr2+r’2, its derivatives are 
2 7 * 42 OF rr’ 


GT oF 
and the Euler-Lagrange equation is 

a aaa a a 

ao \ Ftart) Par 
Expanding this gives 

rr a> gee ae) Gee 

Vp? +r 82 pre 


and this reduces to r?r” — 3r7r’? — 2r* = 0, which on dividing by r?, gives the 
quoted equation. 


(b) In order to simplify this, consider the first two derivatives of 1/r%, 


a7 24 - q d? a rll ; y!? 
do \ ro) i: pot oF doz \ ro} ks: por ls \ ats 
and hence 


O25 ===. ( : | 


r a de? \ ro 


Thus if we set a = 2 and substitute for r’”(@), our equation becomes 


as 1 d? 1 
~o 5g (a) -27= 4 or ae feet: where z= —>. 


2 dl? Vr do? r= 
The general solution of the equation for z is z = r~? = Acos26+ Bsin 26, but 
since 
2 oc  eoe 
sin 20 = 2sin 8 cos6 = —" and cos 20 = cos? @ — sin? @ = ~ = ; 
r r 


this becomes A(x? — y*) + 2Bzry = 1. 


Solutions to Exercises in Chapter 3 


Solution 3.4 


The functional is 


hea ‘ as nF (=z) = i * dy! (y)F(1/2"(y)), 


which is the required result. 


Solution 3.5 
Applying the boundary conditions gives, for y;(2), 
O=A+C and 1=8B+C cosh(z/2) + Dsinh(z/2), 
and for yo(x), 
O0=-A+C and —1=-—B+Ccosh(m/2) + Dsinh(7/2). 


Since A= C' and A+C =0, we have A= C'=0. Then the equations for B and 
D become 


B+ Dsinh(7/2)=1 and —B+ Dsinh(7/2) = -1. 


Adding these two solutions gives 2D sinh(z/2) = 0, so D = 0 and hence B = 1. 
Thus the required solution is y;(x) = sinx and yo2(x) = —sinz. 


Solution 3.6 

The conditions at x = 0 give y;(0) = A+ C =0 and y2(0) = -A+C=0,so A= 
C = 0. The conditions at x = 7 give y;(7) = Dsinhz = a and yo(m) = Dsinhz = 
GB. If a # G, there is no solution. If a = (, the solution is 


sinh x Bsi = sinh x 
vc) = —Bsinz+a-— 
sinh 7 J2 sinh 7 


yi(z) = Bsinz+a for all B. 


Solution 3.7 

In this case F = yj? + ys" + yy, 80 
OF OF OF OF 
——-=—=0), —=2y,+y, and —— =24+4;. 

Oy Oys ae oe iis 


Hence the two Euler-Lagrange equations are 


d d 
2 (2y, +45) =0 and oe (2y5 + y,) = 0. 


These may be integrated directly to give 2yi + y5 =a, and 2y5 + y; = a2, where 
a, and ag are constants. Now integrate again to obtain 


2Y1 + Yo = G72 a bi and 2y2 =o Yi = AQz + bo. 


But the boundary conditions at x = 0 give b} = 1 and bg = 2; at x = 1 we have 4 = 
a, + 6b; and 5 = a2 + bg; hence a, = 3 and a2 = 3, and the solutions are 2y; + yo = 
32 +1 and 2yo + y, = 3x + 2. Multiplying the first equation by 2 and subtracting 
the second now gives y; = £ and yg = 2+ 1. 


Solution 3.8 


We can write the functional in the form 
Styn.val= [de [(v. + 406)? + 3087) 
Thus, on defining z; = y; + y2/2, this becomes 
Siz. Me) = [ dx (z\" + 495°), u@Q)—s; 210) =2, oO) =i wl) = 2. 


The associated Euler-Lagrange equations are simply z{/ = 0 and yf = 0, which can 
be integrated directly to yield z; = Aix + B, and yo = Aox + Bg. The boundary 
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conditions for z1(x) give By = 1/2 and A; + B, = 2, so A, = 3/2. For yo(x) we 
find By = 1 and Ag + By = 2, so Ap = 1. Hence the solution is 


z=$e+5 and y=2r+1, so y(t) =2(x) — 5y2(r) =a. 
Solution 3.9 


Since 


b 
Sly1, y2! =| da F (2, 94; 92, 01.86) 5 


from the definition (3.51) we require the derivative 


d 


b 
A(e) = =| dx F(x, 41 + €91, y2 + 92, y, + €9}, 4 + €95) 


b 
d 
=| dx aeh (am + €91, Y2 + €g2, Yi + €G1, Yo + €99)- 
But, the chain rule gives 

d OF OF OF OF 
ace (e yi + gi, yo + €92, 41 + €91, Yo + €95) = ae =i ays rae rl oe as Oy) ae 
Now set ¢ = 0 to obtain equation (3.50), since AS = A(0). 

Solution 3.10 


(a) In this example F(y, y’) = >y_, pee ie Aijy; — yiBijy;) giving 


= Yau + Dou _ 
bs! 


which, because A;; = Aj;, gives 


A =, feo 
= ‘IT da 
ye j=i 


Similarly, 
OF ~ 
~—=-2) Bay, 1S<k<n, 
O"k j=l 


which gives the quoted Euler-Lagrange equations. 
(b) Using the standard rules for matrix multiplication, we see that 
Tr Tr mr Te 
y Ay =) > viAgys, y By => ">> yiBizy;, 
i=1 j=1 i=1 j=1 


which gives the required functional. Similarly, 


. dy ee 
(By)k => > Busy; and (43) = 2 [see %, 


jut jel 


so the Euler-Lagrange equation is Ay’”’ + By = 0, and assuming that A is non- 
singular, so that A~' exists (as is the case for the positive-definite matrix A) 
we obtain the given equation by multiplying by Aq! 


Solutions to Exercises in Chapter 3 


Solution 3.11 
(a) If 


b 
Stun.vel =f da [5 (vi? +957) -Vir)], r=+/y? +93, 


we have 
a. ae a 
Oy}, nO dr OY Doge vo”? 


Hence the Euler-Lagrange equations are 
dy, | dV yk 
dx* ar + 


= £= 1,2. 
(b) For the functional 
b 
S\e;0| = / dx [Sr/? a re g!* —V(r)], 


we have OF /00' = r76’ and OF /00 = 0. The Euler—Lagrange equation for 6 is 
d ({ dO ae L 
r= (: =) =; ging ¢ = “2 
for some constant L. Also OF /Or’ =r’ and OF /dr = r6’* —V'(r), so the 


Euler-Lagrange equation for r is 


d?r 
Pe; + V(r) a ro!? a | 


Substituting for 6’ = Lr~? gives the required equation. 


Solution 3.12 


(a) We need to find the second derivatives of y; and y2 in terms of the derivatives 
of r and @. Since the equations relating y; and y2 to r and @ are products, this 
is most easily achieved using Leibniz’s product rule: for second derivatives this 
is 

d? // Lae 4 II 
af (aga) = fg + 2g + fa. 


Hence for y; = rcos@ we have 


d?y, dr dr d d? 

a ga 2 cosd + ras cos 0. 
But 

d do. d? a0. do \* 

= cos 6 = ap sin 0, 7) cos 6 = = a sin 6 — (=) cos 0, 
hence 

dy, dr dr dO | d20 | do \* 

a ae cos 8 — 2 da sin 0 — rT sinfd —r (=) cos 6 

= (r” — r6'*) cos@ — (r6” + 2r'6") sin 6. (3.151) 

Similarly, for yo = rsin@ we have 

wm er. an: ae 

= = qa2 sinG +27 sind +r—5 sin 0. 
But 

ole 2. &0 do\* 

7, sine re Fg C89) 772 sing tee Fan 0088 — (=) sin @, 
hence 

dy, dr | dr do d20 do\? | 

> a age NU +2 | cose + TS cos? —r (=) sin 6 


= (r” — 76’) sind + (r6” + 2r'6’) cos. (3.152) 
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The equations (3.80) are therefore 
(r” — r6’*) cos — (r0” + 2r’6’) sind + V'(r) cos é = 0, 
(r” — r6’*) sin 6 + (r0” + 2r'6’) cos + V(r) sin @ = 0. 


(b) Multiplying the first of these by cos 6, the second by sin@ and adding, gives 
pr! —rQ’*+V'(r) =0. 
Multiplying the first of these by sin@, the second by cos@ and subtracting, 
gives 
ld 


ro” + 2r’é’ =0, but a (r76") = 10" + 2’, 
r dx 


which gives the second equation. 


Solution 3.13 
Differentiating the given expressions for y1, y2 and y3 with respect to x gives 
y, =p cos —p¢'sing, yo =p'sind+p¢d'cosd, ys = 2’, 


so yi? + yh? + ys? = p’? + p?d'? + z’?. Hence the functional becomes 
b 
Slo.d.2]= dx [} (0? + 6"? +2") -V(o)), 


The three Euler-Lagrange equations for p, ¢ and z are, respectively, 


2 2 
os (2) + a=, 


dx? 


The second of these equations gives ¢’ = Lp~?, where L is a constant, and hence 
the first becomes p” — L*p~* + V'(p) =0. Hence in this coordinate system the 
Euler-Lagrange equations for p and z are uncoupled. 


Solution 3.14 

(a) Since 4(X) = y(x) the chain rule gives 
ogee oe dx j 
HE) = Kula) = = (1 +8) y'(e) 

(b) We have 


Em 
i+d 


with a similar expression for €. Hence, to first-order the expression for T is 


d= —— =d(1-6+0(8)) =d—d5+ 0(8), 


d d—dé 
T = | du F(u,9(u),9/(u)) = o du F(u,9(u),y/(u)) + O(8?). 


Splitting the integration into three integrations over the intervals (c — cé,c), 
(c,d) and (d — dé, d) gives 


d—dé 
f= / du F(u,3,7') + O(8) 
c—cod 


d Cc d 
2 / cP (0,9) 4 / ds F (alge) L / hemag FP ROw ). 
Cc c—cé d—dé 


Also, by definition, we have, 7(%) = (x) so y(a/(1+6)) = y(x) giving 
y(u) = y(ut ud) = y(u) +y'(u)ud + O(6"). 
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Differentiation with respect to u then gives 


y(u) = y'(u) + i (wy! (u)) + O(6) = y/(u) + 6 (y/(u) + uy" (u)) + O(8). 


(c) With these expressions the functional T may be expanded. The integrand 
becomes 
OF 


OF 
F(u,yt+y'd,y + (y' + uy”) 6) =F + (w/5- + (y’ + uy") Dy! 


Oy 


where F and its derivatives are evaluated at (u, y,y’). 


) 5+ 00), 


The boundary terms are 


[ ae = cF(c,y(c),y'(c)) 6 + O(8") 
and 
: du F = dF(d,y(d), y'(d)) 6 + O(6’). 
d—do 


Hence on using equation (3.111), that is the expression of the fact that the 
functional is invariant under the transformation, to order 6 we obtain 


d 
am , OF / OF a d 


(d) Integration by parts gives 


d d d 
OF OF d ( OF 
M eer = Pi cae == — =e 
I an (“5F) : “5. I anaes (57) 


Thus, equation (3.153) becomes 


OF SR Or od far 
= [(vae-*)] +L & [way ae (5p) +0 
OF 


d 
If y is a solution of the Euler-Lagrange equation, then (5 ‘pl ae 


the integral is zero, hence 


d 
iS 
c te ~ F)| =0 for allcandd. 


The required result follows. 


Cc 


(e) If F = x%y’%, the first integral is, for some constant c, 


dy A i+a 

K atl1l,/6B _ as = 

(G-aatty=e or Baz, yaa 

If y = 1 the solution is y = Alnx + B: otherwise it is y(z) = Av’~7/(1—y) + 
B. 


It is necessary that the functional 


d 
s= | dx x%y!? 


be invariant under the scaling x = (1+ 6)z. If we set z = Inz and Z = InZ, we 
see that z = Z+In(1 +), so with this independent variable the scale trans- 
formation becomes a translation, which is the case dealt with in the text. 
Transforming the independent variable to z gives 


d 6B 
a dy dy «. dy dz 1 dy 
= (lta B)z —_— —S>S= = —————S- 
[ cs @ da ded e# dz’ 


In order for the functonal to be translationally invariant this must not depend 
explicitly upon z and hence we need 1+ a = £, that is y = 1. 
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Solution 3.15 


Express the integrand of the left-hand integral in terms of f(x) and g(x) using the 
given relations. We have 


f (x)? = f'(x)? cos? a + g'(x)* sin? a — 2f'(x)g'(x) sinacosa, 


g (x) = f'(x)? sin? a + g’ (x)? cos? a + 2f’(x)g' (x) sin acos a. 
Since cos? a + sin? a = 1 this gives 
F(a)? +9 (a)? = f'(2)? +9'(2)? 


and hence the two results. 


Solution 3.16 


All the given transformations depend continuously upon only one parameter, 6. 
Transformations (a) and (c) reduce to the identity when 6 = 0, but (b) gives (%, 7) = 
(x,—y). Hence (a) and (c) define a one-parameter family of transformations, but 
(b) does not. 


Solution 3.17 


(a) Separating variables puts the equation in the form 


fas = [at z= y(0). 


The integrand of the left-hand side may be decomposed into partial fractions 
1 1 1 


wl-@) @- i=« 


so integration gives 


nS) = 


and rearranging gives 


(b) There is one parameter, t, and at t = 0, we have y = z, which is clearly correct 
because this is just the initial condition. 


Solution 3.18 


In this case 


But 
dy, dy dy2 . dx dY> dy : dyo dx 
—— = {| —coshd + — sinhéd ]) — —< = | — — = 
= ( a cosh 6 + Ta sin c= and = rs sinh 0 + cs cosh 6 a 

SO 

Hence 


r= [wis (a) [(ie) -(@) 
- «rare (@) -(2) | 


Hence T = T only if g'(x) = 0, that is, if g(x) is a constant. 
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Solution 3.19 


The Taylor expansions of the trigonometric functions are, 


a? a (—1)"a?" - 
COS Q = ae er ee ;, 
a? a? (—1)"q2rt1 ‘ 
a a ie 


Hence, to O(a), from transformation (3.115) we have ¥, = y1 — aye, Jo = ay + yo, 
which can be written in the form Y,; = y1 + aw,, Yo = y2 + ayo, where W, = —ye2 
and Y= 1. 

Solution 3.20 


In this case ¢ = 0, ~, = —y2 and 2 = yi so equation (3.133) becomes 


OF OF ; / 
—5yr fe Bi = 2y1Y> — 2y2y, = constant. 
fi 2 


Solution 3.21 


Consider the general transformation 


1=Y1 + 09; (2), Yo = Yo: + dgo(x), z= x + d93(x), 


so that 
dy, dy / dz dYy dy2 dx 
CF CA 5 O92 C22 1 se 
= (‘P+ (2) ) eae = ag * 92() } ae 


becomes, to first-order in 0, 


ore 1 dy, \"  (dy2\* ee eee 
T= | pk Pe PTE (#) +() +28 (of (2) G2 + o4(e)  ) : 


Now consider the three specific cases: 


(a) If g1 = g and gz = g3 = O, then 


d 
- dy 
- 2 dx g'(x)—. 
i= t+ s/f ey (x) ms 


(b) If go = g and g; = g3 = 0, then 


d 
on dy2 
T=T+206 fee (2) ——: 
+ / rg (x) 


(c) If gg = g and g; = gz = 0, then 
d 2 2 
ma i dy dys 
T= ——_____ | | —— a ; 
I Ty bya) (2) aes, | 


In all cases T = T only if g(x) = 0, that is, if g(x) is a constant, which we set to 
unity in the following. The resulting first integrals are found as follows: 


(a) ow, =1, ¥% =¢=0, hence s = constant, giving y;, = constant. 
1 
b) ww =1, v0, =¢=0, hence 24 = constant, giving y, = constant. 
2 1 ys 2 


(c) @g=1, ¥, =. = 0, hence F — Uh our _ Yd Bur = constant, 


giving y}? + ys? = constant. 


In this example the first integral arising from the invariance with respect to trans- 
lations in x can also be derived from the two first integrals due to the invariance 
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under the translations in y; and ya. Thus not all symmetries lead to new first 
integrals. 


Solution 3.22 


In this case, since % = x, we have 


= [a E (2) : (®) ) +VQy -%) 


Replacing Y; by yx + 6g(a) and expanding to first-order in 6 gives 


r= [uf (2) (BY) +90 (+88) 70m] 


Thus T = T only if g = constant. In this case equation (3.133) becomes, on setting 
OF OF 
ro 
Oy, ys 


=constant, that is, y; + ys = constant. 


Solution 3.23 


In this example 7 = a =x —6x+O(6") and 7 = y. From equation (3.131) 
@ = —x, and from equation (3.133) 


OF 
£ (F — V5-) = constant. 
Y 


Solution 3.24 

Les ., 
dy _dydz 1 _dy 
dx dzdxr  cze—! dz’ 


so the functional becomes 


‘ 1 1 12 > 9 1 : y!? -—. :.2 
Sy) =e f oe a (saa — uty) =* [as ( - eatery) 


where @ = a!/¢ and b = b!/¢. The factor 1 /c, external to the integral, is ignored in 
the following analysis. 


The Euler-Lagrange equation for this functional is derived from the relations 


OF 2y/ OF 

= = ee Seu ety. 
and is 

af 29 

iz (gar) +27 y= 0 


which expands to 


a ee 
Se tH Cr =O. 
yet ag? a az 3 “ 
Multiply by z° to obtain the required equation. This is the transformed form of 
the differential equation y” + w*y = 0, because the latter is the Euler-Lagrange 


equation corresponding to the original functional. 
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Solution 3.25 


The functional depends only upon y'/, and y5, so the Euler-Lagrange equations are 


d (OF d -( OF 
= (ar) 7° a = (az) 7° 


Expanding these equations gives 


a ae og a OF O° F 
a 2: A intone es 
aye By ay,” a” By ouht * Byhe 
If d # 0, the only solution of these equations is y// = 0 and y!/ = 0, and integration 
gives the family of straight lines y;(z) = Ax + B and y2(x) = Cx+ D. 


If d = 0, the two linear equations for y// and y¥ are the same. In this case we use the 
original equation, which integrates to Py (yi, ¥5) =c1, where c; is a constant. The 
other equation, Fy (1, y4) = C2, is the same because d = 0. Now there is insufficient 
information to find y; (a) and y2(x): all we know is that their derivatives are related. 


Consider the simple example, 


b 
Sine / eg. er 


for which d = 0, and the Euler-Lagrange equations are both y// + y4 = 0, so that 
yi + yo = Ax + B, for some constants A and B: that is, only the sum is known. 
Another way of seeing this is to introduce two new variables u = y; + y2 and v = 


b 

Y1 — y2 so the functional, S = / dzu’*, depends only upon u and u = Ar + B, 
a 

but v(az) is undetermined. 


If there is only one dependent variable, the Euler-Lagrange equation is 

ad for oF 

oe foe te, OF y = 

dx \ Oy’ Oy’? 
If OF /Oy' = constant, there is no unique solution for y(x); thus the equivalent of 
d#0 is that OF /Oy’ is not a constant. 


Solution 3.26 


Observe that 


i Rs = / 
dx Oz By, Yo; 


Oy2 
Salyi, y2| =| 


= Sy[y1, y2] + [®(a, yr, y2)|2 0 . 


SO 
b 


d® 
dx |F ‘a 
| (Ya Y2,Y1»Y2) + 


The boundary term is independent of the path, so the stationary paths of the two 
functionals, S; and S92, are identical and have the same Euler—Lagrange equations. 


Solution 3.27 
Replace each of the terms gz(x)y;,, k = 1, 2, in the original functional by 


d 


to obtain Sg, with the addition of the terms (gz(x)yx(x))’ in the integrand. Then 
apply the result of Exercise 3.26, with ® = —g,y1 — goye. 
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Solution 3.28 


(a) The required derivatives of the integrand are 


OF > e* OF 22 j 


ay Wynew Oy Wy ew 


The Euler-Lagrange equation is therefore 


d 1 ik 
dax (a = ~7 | " Q/y — ery ; 
On expanding this expression we obtain 
ery! + ety! —y! en a 
Aty—ety!)8i2 hy ety’ 
Rearranging this gives y” — (3e~* — 1) y! + 2e~*7y = 0. 
(b) Ifu =e -*, x = —Inu, and Y(u) = y(2(u)), we have, using the chain rule 
dy _ dY du ee 7 dy dY 


=-—e ~“—, hence e*— = —— =-—Y 


dx du dz du dx du 


When « = 0, u = 1 and when ¢ = co, w= 0, so for any function f(x) 


[ aete) = [a = F((u) = 7 du— f(~ Inu). 


Thus the functional becomes 
o [ du JY (u) + Y"(u). 


in this.case F=f Y + and OF /0¥ = oF/0r' = 


Euler-Lagrange equation is 


S(Y +Y’)~1/2, and the 


d z 1 
ce ee eee ee) ge OY" a)” 2 oY = 0. 
du ) 2/Y +Y’ 


(c) Ifa =-—lInu, 
ay _ dy dx oo 


du dx du ae 


and similarly 
crs. o d oo YY 
duz da \" dex 
so equation (3.148) for y(a~) = Y(u(a)) becomes y” — (3e~* — 1) y’ + 2e777y = 0. 


Solution 3.29 
if F = y'* + 2* + 2ye, then 


OF OF OF OF 

rar tee : — =2 d — =2 : —— <2 

Oy’! sd Oy ae ned’: . 
so the Euler-Lagrange equations are 

d*y dz 

a eal and 7 


Putting z = y” into the second equation gives d*y/dx* —y =0. Now put y = 
aexp(Ax), where a and X are constants, to obtain \* = 1, that is \ = +1 and +i. 
Thus the general solution is 


y= Acosxz + Bsinz + Ccoshz + Dsinhg, 
and 


z=y" =—Acoszx — Bsinz + Ccoshz+ Dsinhz. 
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The boundary conditions at x = 0 give A+C =OandC—A=0, hence A=C=0. 
The boundary conditions at « = 7/2 then give 


2 = B+Dsinh(r/2) and 4 =—B+ Dsinh(z/2). 


Subtracting and adding these equations gives B = 1/2, and 2Dsinh(z/2) = 2. 
Hence 
sinh x sinh x 1 


+2% ' 
— ———__ + = d ae ay | 
sinh(7/2) ee sinh(7/2) acini 


Solution 3.30 


Using the expression for y’ derived at the begining of the solution of Exercise 3.2(b), 
the functional becomes 


b 
Sly] =| dz F(x, G(z), G' (ee a wheres = 
: £ 
Since 
OF OF dy _,, OF OF OF dy OF dy! OF, .. OF 


G"(z) + —G"(z)2z’, 


——s SS SS , —— —S. «= So —— = — 
Or of oe ar waa: Oy Oz Oy! Oz Oy Oy’ 


the Euler-Lagrange equation becomes 


d er OF 2 | as 
eo (c oe G'(z) Gas == 


Solution 3.31 


(a) The functional is 


B B 
Sta] = f ay aan f wou Aja, xB) = 2. 
(b) Putting F = 1/(x?z'), we have 
OF 1 OF 2 
Oa? (wa? °° On ~ Ba” 


so the Euler-Lagrange equation is 


Se hee 
day. (arse * aq! 


d a 2 
dy \ (xa')? 2a 3 By!” 


which gives the required equation. 


But 


(c) Integrating once gives x*x’ = c for some constant c. Integrating again gives 


x? /3 = cy +d. The boundary conditions give 


5 =cB+d, 


3 =cA+d and 
giving 

a tpt =8A4 

= B-A 
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Solution 3.32 


(a) This problem is slightly different from all previous examples because there are 
no boundary conditions prescribed: this means that some care is needed. If 
y(x) is a stationary path and y(x) + €g(a) a varied path, the standard analysis 
gives 


ar=2/ Pe ey wo") - (2) u(ea(o)) 


and integration by parts gives 


x = d n? 
AF = 2[zy'(x)g(x)\, — 2 | dx g(x) (ay) 4. (« — =| u(e) 

0 L XL 
The boundary term vanishes at « = 0 because y’(2) is finite here. However it 
does not vanish at x = X. We solve this problem by imposing the boundary 
condition y(X) = Y 4 0 on the admissible functions, so now g(X ) = 0 and the 
boundary term vanishes. Hence the Euler-Lagrange equation is 


d n? 
ay ty) + (: a =| y(x) = 0, 
or 
a d 
oe (x* —n7)y =0. 


This is a linear equation; the solution is therefore undetermined to within a 
multiplicative constant, so the value of Y, provided it is not zero, is irrelevant. 


(b) If « = f(u), then the chain rule gives 
dy _dydu _ w'(w) 


dx dudx  f'(u)’ 


and the functional becomes 


Plu] = f° du pw) | Fw)? = (su) - 2) w(u?| 


Fu? ’ 


where f(wo) =0 and f(uy) = X. 


(c) If f =e” this functional becomes 


Fiat = / du |w'(u)? — (e —n?) w(u)?], where uy =InX. 
The Euler-Lagrange equation for this functional is w”(u) + (e?“ — n?) w = 0, 
and this has a solution w(u) = J,(e"). 


Solution 3.33 


In the first case, if OF /Oy = 0 the differential equation derived in Exercise 2.33 can 
be written as 


d (4 (OF) _ OF) _ 4 here 2 (2F)_ OF _ octane 
iG ae oy oe ha ence da \ By" Dy = constant. 


In the second case, if OF'/Ox = 0 it is easiest to use a version of the analysis described 
in Section 3.5.1. Equation (3.93) becomes 


d—6o 

T= f  duFGu).¥(w).7"w) 
d Cc d 

= / du F(y,y',y”) + / duF(¥,y',y”) - / duF(y,y',y"). 
c c=6 d—6 
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On putting u =v +6 and expanding to first-order in 6, we obtain 


d 
O OF OF 
= i 3 yl! MI 
T= | dv Pu!) +05" y + oa GY + 65 iY 


and because G' is invariant under x-translations, this gives 


— 6[F(y,y',y" Io + O18), 


d 
a OF OF yl OF i. ao r yid 
a= | dv Ea + — Dy 7 is ay | IFiy.y,y I. + O(6), (3.154) 


which is the equivalent of equation (3.103). Now integrate by parts: 
d d d 
OF OF d (OF 
d soa = oe, = / d ee tae 
; Oy : oa ie | Ga 
d d d 2 
OF OF d (OF d OF 
d /1/ — // — pes d Sinan euiaiales 
I ae : ay" de ar) +f oO de? (ar) 
Hence equation (3.154) becomes 


,OF ,d (OF OF 2 far gd for. far 
2h ee eet eee aS eee Bee oe 38 
aa ae) Lobe G-5(S)+3 (577) = 08 


But the integrand is zero and the end points c and d are arbitrary, hence 


y OF a OF oF ’_ F = constant 
7 Oy" \ da \ dy") ~ dy ) ¥ ; | 


vanes cated = : ‘f 5 ms. 


“i 
‘ ae 
_ = : me 
_ va - a 
a —— a 7 
A SP a 9 
| "(eee #). 


a Pa - A = 

ae a owe 

ie OT cae . 2 -as 7 aa 
ae _ 1 ses: 


| -wwaqert> at zaztoiend of anctruto? 


ainido aw A tas tet od sath | baw t+ee ies 
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CHAPTER 4 
Newtonian dynamics 


4.1 Introduction 


The principal aims of this chapter are to formulate Newton’s equations of 
motion as a variational principle, to derive from this Lagrange’s form of these 
equations, and to show how these may be used to simplify the derivation of 
the equations of motion for certain types of mechanical systems. 


You may be wondering why Newton’s laws, as formulated in previous courses, 
need reformulating: there are several reasons and, in no particular order of 
significance, the main ones are listed below. 


(1) A variational formulation of the equations of motion is more elegant and 
leads to powerful methods of finding and understanding solutions of the 
equations of motion; see point (3). 

(2) The application of Newton’s equations to even quite simple systems is 
awkward and tedious. This difficulty was overcome by Lagrange who, 
building upon the work of James Bernoulli and D’Alembert, reformu- 
lated Newton’s equations of motion in terms of generalised coordinates 
—a term that will be defined in Section 4.3.1. 


The complete transformation to the modern formulation was made by 
the Irish mathematician Hamilton in 1827 with the introduction of 
the Hamiltonian function — although an important element of mod- 
ern physics this is not part of the present course. He also described 
solutions of Newton’s equations in geometric terms using what is now 
named Hamilton’s principle, from which Lagrange’s equations of motion 
can be derived. The main aspects of Hamilton’s principle (which does 
not involve Hamilton’s function) will be described in Section 4.4. 


It is worth noting here that towards the end of the 19th century, as 
physics progressed to the study of atomic particles, the classical mechan- 
ics of Newton was found to fail. Then Hamilton’s ideas were important 
in the development of the new quantum mechanics that describes the 
dynamics of atoms and molecules. 


(3) By expressing Lagrange’s equations of motion as a variational princi- 
ple, the underlying mathematical structure of the equations of motion 
is clarified. This leads to the development of techniques and methods 
that help understand the behaviour of solutions and also help with their 
computation. 


Perhaps the most dramatic example of this is Poincaré’s discovery of 
chaos in 1898, made by extending the geometric ideas first formulated 
by Hamilton. 


Joseph-Louis Lagrange, 
1736-1813, 


James Bernoulli, 
1654-1705. 


Jean le Rond d’Alembert, 
1717-1783, was an 
illegitimate child, 
abandoned on the steps of 
the church of St Jean le 
Rond, from which his 
forenames came. This 
church, originally adjacent 
to the north wall of Notre 
Dame Cathedral, was 
demolished between 1742 
and 1765. 


Sir William Rowan 
Hamilton, 1805-1865. 


Jules Henri Poincaré, 
1854-1912. 
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(4) A variational principle formulation of the equations of motion is inde- 
pendent of any particular coordinate system, because of the invariance 
of the Euler-Lagrange equations discussed in Section 3.2. This feature 
provides a route to powerful methods of approximation, to the more 
general theories of special and general relativity, and is helpful in the 
formulation of the wave equations that describe electromagnetic radia- 
tion. These topics are beyond the scope of this course, though a wave 
equation will be derived in Subsection 4.5.4, in order to show how the 
first and last parts of this course are connected. 


In Section 4.2 we quote Newton’s laws and derive Newton’s equations of 
motion for three typical situations: these three examples are chosen to il- 
lustrate some of the technical problems that occur, and which are overcome 
by Lagrange’s formulation, described in Section 4.4. The material contained 
in Section 4.2 covers the same ground as previous level 2 courses: it is not 
assessed in this course, but you need to be familiar with the ideas in or- 
der to understand later sections. In Section 4.3 we discuss Newton’s laws 
and introduce the important ideas of generalised coordinates and kinetic 
and potential energy, all of which are essential to Lagrange’s formulation of 
Newton’s equations of motion. 


Lagrange’s equations and Hamilton’s principle are described in Section 4.4. 
In the final section before the end of chapter exercises we apply Lagrange’s 
method to a variety of problems, ending with the derivation of the wave 
equation, which extends the theory to dynamical systems of infinitely many 
interacting particles and relates the theory of this section to that developed 
in Block 1, Chapter 1. 


The assessment questions based on this chapter will be on material contained 
in Sections 4.4 and 4.5, though an understanding of this work is possible only 
if you are familiar with the ideas discussed in earlier sections. None of the 
historical material is assessed, and may be omitted. 


4.1.1 On nomenclature 


Newtonian dynamics describes the motion of physical objects. In order to 
obtain manageable equations, we idealise these physical objects by math- 
ematical constructs, and also make various kinds of approximations. The 
link between the mathematics and the physical world is by experiment, often 
overlooked but one of the crucial methods of validating our approximations. 


The primary concepts of Newtonian dynamics are mass, length and time; 
here we assume the intuitive notion of these; you should be aware, however, 
that these concepts are more difficult to understand than first appearances 
suggest. 


The basic object of dynamics is a particle, a mathematical object with mass, 
but no volume or structure. Particles are used to approximate a wide variety 
of physical objects, for example, electrons and stars. It is a useful approx- 
imation where the physical size of objects is small by comparison with the 
distance between them, and where their internal structure is assumed to 
play an insignificant role in their relative motion. 


An example is the relative motion of the Earth and the Sun, which for 
many purposes is accurately described using two particles, because their 
radii, respectively 4 x 10° and 4 x 10° miles, are far less than the mean 
distance between them, 9.3 x 10’ miles, and over this distance the physical 
structure and slight departures from spherical symmetry of each does not 
significantly affect the relative motion, except over very long times. If, on 
the other hand, the rotation of the Earth is of interest, then it needs to 
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be treated as a body with structure. As usual the approximation required 
depends upon the questions being asked, but it is often difficult to know a 
priori which approximations should be made. 


The next object of dynamics is the rigid body, which is used to approximate 
a physical object that does not deform significantly under the forces present; 
for instance a steel ball bearing can often be treated as rigid. The distinction 
between a rigid body and a particle is that the former has an orientation in 
space, so it can rotate and have angular momentum; the spinning Earth is 
an example. In this chapter we do not deal with rigid bodies, nevertheless 
we shall use the terms body and particle freely. 


The collection of objects being considered is often referred to as the system 
and its mathematical description is named a dynamical system: sometimes 
the distinction between these entities is blurred. The motion of a dynamical 
system is described by differential equations, generally referred to as the 
equations of motion. 


A number of other common English adjectives are used in a more precise 
manner than in common parlance, but such precision is essential for our 
simplified approximations of reality. Those used freely in this chapter are 
listed below. 


Light: For example, ‘a light rod’ — this means that the mathematical object 
has zero mass, and is useful for approximating a physical object with a mass 
sufficiently small that we can assume it to have a negligible effect on the 
motion. The opposite of light is heavy, which means that the mass cannot 
be ignored. 


Inextensible: This term is used to describe rods or strings, for instance — 
it means that the length of the mathematical object does not change and is 
used when the forces involved do not significantly change the dimensions of 
the physical object. 


Rigid: For example, a ‘rigid rod’ — it means that the mathematical object 
does not bend and is used when the physical object bends little. A synonym 
is stiff. A rigid body can be constructed by joining two or more particles 
with light, inextensible, stiff rods. A rigid body need not be inextensible, 
but it must not bend. 


Smooth: This is a particularly useful term meaning without friction; syn- 
onyms are free and frictionless. The opposite of smooth is rough, and for 
such surfaces friction cannot be ignored. 


Dot notation: In this chapter we shall use the notation introduced by 
Newton whereby the first and second derivatives of a function of time, f(t), 
are represented by f and f respectively, that is, f = df /dt and f = d? f /dt?. 
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4.2 Newton’s laws 


In this section we use Newton’s laws to derive the equations of motion for 
some simple systems. It is assumed that you are familiar with Newton’s 
laws, and have used them to derive equations for some simple dynamical 
systems. ‘he conventional statements of Newton’s three laws are as follows. 


Law I Every body continues in its state of rest, or moves with constant 
velocity in a straight line, unless a force is applied to it. 


Law II ‘The change in motion is proportional to the force acting and takes 
place in the direction of the straight line along which the force acts. 


Law III For every action (that is force) there is always an equal and op- 
posite reaction: or, the forces two bodies exert on each other are 
always equal and opposite directions. 


You may be familiar with a different version of the second law in which 
“change in motion” is replaced by “rate of change of velocity”. Newton 
used the former because he experimented with impulses; see Exercise 4.30 
(page 185). The version given here describes his observations more appro- 
priately and includes continuously varying forces. 


These laws provide an accurate description of a wide variety of physical 
phenomena. The mathematical expression of the second law is normally 
called Newton’s equation of motion. These laws are, however, awkward to 
apply in all but the simplest situations, and the mathematical structure 
of the resulting differential equations is not usually clear. The Lagrangian 
formulation of these principles overcomes these problems. 


In the following three subsections we apply Newton’s laws to a variety of 
examples in order to demonstrate some of these problems. We start with 
some quite simple examples and end by deriving complicated equations of 
motion for the double pendulum. It must be emphasised that this material 
is not assessed, but provides essential background that you should under- 
stand. Moreover, we shall use the examples provided here to illustrate the 
more general discussion that follows in Section 4.3, and which leads to the 
statement of Hamilton’s principle in Section 4.4. Although this material is 
not assessed we advise you to do as many of the exercises as time permits. 


4.2.1 Particle moving on a parabolic wire 


In this example a bead is constrained to slide under gravity on a smooth 
rigid wire in the shape of the parabola y = ax*/2, a > 0, where the y-axis 
is vertically upwards, as shown in Figure 4.1. 


4.2 Newton’s laws 


Figure 4.1 Diagram showing a bead of mass m at a point P on a wire in the shape 
of the parabola, y = ax?/2, for x > 0. The tangent through P makes an angle 6 
with the x-axis. 


In the first instance we shall suppose the wire to be stationary, then we shall 
allow it to move bodily in the y-direction in a prescribed manner. In both 
cases the position of the bead can be defined by one coordinate, and the 
system is said to have one degree of freedom. Here we shall take this to be 
the x-coordinate, but others could be used; for instance we could use the 
y-coordinate or the distance along the curve from any given point. 


In order to use Newton’s equations directly we introduce a force of con- 
straint which, in this case, is a reaction normal to the tangent at P and of 
magnitude R, which is necessary to keep the bead on the wire. The reaction 
is perpendicular to the wire because we assume that there is no friction, so 
there is no force between the bead and the wire along the tangent. If the 
gradient of the wire at P is tan @, as shown in Figure 4.1, then the force 
of reaction makes an angle @ to the vertical through P. Thus the z- and 
y-components of Newton’s equations of motion are, respectively, 


mz=-—Rsin@ and mij = Rcosé? — mg. (4.1) 


Now we need to eliminate the force of reaction; this is achieved by multiply- 
ing the first equation by cos @, the second by sin@ and adding to give 


m(z%cosé@ + ysiné) = —mgsiné. (4.2) 
Now divide by mcos@ to give 
£+ytané = —gtané. (4.3) 


But x and y are related because the bead is constrained to the wire, that is, 
y = ax*/2, and the chain rule gives 


y=ark and y=ari+ai’. (4.4) 
Also we have tan@ = y'(x) = ax. Thus equation (4.3) becomes 
#(1+a°x*) + a°xa* + agz = 0. (4.5) 


Later, we shall see that this is Lagrange’s equation, and we shall learn how 
it may be derived more simply without introducing the reaction force. 
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Exercise 4.| 
By differentiating the quantity 
E = 3mi* (1+ oe) + imgaz? (4.6) 


with respect to ¢ and using the equation of motion (4.5), show that E is a positive 
constant. Hence show that the equation of motion can be integrated once to yield 
the first order equation 


d 2K — 2 2E 
So ee - gee et 
dt m(1 + a?x?) 


mga 


We shall see later that the quantity FE’, defined in equation (4.6), is the total 
energy of this system, and is a constant on all solutions of the equations of 
motion, with a value that depends upon the initial conditions. 


A vertically moving wire 


Now suppose that the wire is moving vertically with given displacement 4(t) 
from a fixed point at time t. We can represent this motion by allowing the 
coordinate system Oxy to move, so that the height of the x-axis from a fixed 
horizontal line is y(t). Then the acceleration in the y-direction relative to 
this line is y + 7, so this replaces y in equation (4.1), which then becomes 


mz=—Rsin@ and my = Rcosé—m(g+%). (4.7) 


Thus the effect of moving the wire is the same as modifying the gravitational 
force. On using the constraint, y = ax?/2, we obtain, the equation of motion, 


#(1+a°2*) +a°rt? +a(g+%4)2=0. (4.8) 


If the wire is moving uniformly, that is 7 =constant, then this equation 
is identical to the original equation. Otherwise the energy, FE, defined by 
equation (4.6), is not a constant, as shown in Exercise 4.2; it can be shown 
that there is no equivalent constant quantity. 


This example may seem rather artificial; however, it illustrates a very com- 
mon circumstance. For instance, if we ride in an aircraft through turbulent 
air or in a vehicle over rough ground, the coordinate system Oxy is fixed in 
the vehicle and the additional ‘force’ — the term ma7 in equation (4.8) — is 
the force felt by a passenger of mass ™. 


Exercise 4.2 


Show that when the wire is moving, the quantity EL, defined in equation (4.6), varies 


as follows 
dE ae 9% 


In this example the elimination of the reaction R was achieved relatively 
painlessly. In other circumstances the equivalent analysis is more difficult 
and the purpose of Lagrange’s theory, developed later, is to make this process 
easier. 


4.2 Newton’s laws 


4.2.2 The simple pendulum 


The simple pendulum — also known as the vertical pendulum — is one of 
the simplest dynamical systems and has attracted the attention of numer- 
ous philosophers, although Galileo discovered practically all of its major 
properties. Until the early 20th century it provided the primary method of 
accurately measuring time. Even today the dynamics of the pendulum is 
important in the study of more complicated nonlinear systems and in some 
aspects of molecular dynamics. Some of these aspects of the pendulum are 
discussed further in Subsection 4.5.1. 


Our idealised pendulum comprises a heavy particle, P, of mass m, firmly 
attached to a light, stiff, inextensible rod of length J. The other end of the 
rod is attached to a fixed point, O, about which it can swing freely, that is, 
without friction and air resistance. Such a system is shown schematically in 
Figure 4.2. 


- 
mg 


Figure 4.2 Diagram showing the idealised pendulum of length /, with smooth 
hinge at O and particle of mass m at P. The coordinate axes are shown with Oy 
vertically upwards. 


We shall assume that the rod OP can rotate about the pivot O, so there are 
no constraints on the value of @, the angle between OP and the downward 
vertical. A bicycle wheel with a suitable weight attached to a spoke and with 
the axle held firmly is an example of a mathematically equivalent system. 
The configuration of this system is specified uniquely by the single variable 
6, and we expect Newton’s equation of motion to depend only upon this 
angle and its derivatives. In this example the configurations corresponding 
to 6+ 2nz, for integer values of n, are physically equivalent, so we expect 
the equations of motion to depend periodically upon @. 


The simplest approach is to use the conservation of energy. However, this 
method cannot be used for more complicated systems, so here we apply 
Newton’s laws directly using a widely applicable method. 


There are two forces acting upon the mass P: 
(a) the external downward force of gravity, with magnitude mg; 


(b) the tension, of magnitude T, which acts along the line OP and constrains 
the distance OP to be constant; the tension also acts on the hinge at O, 
but a frictionless hinge is unaffected by the tension, and the tension has 
no effect on the motion of the mass, other than to constrain it to move 
on a circle. 


The mass P moves in the plane of the paper. We define a coordinate system 
with origin at the point of support, O, with the y-axis vertically upwards 
and the z-axis as shown in Figure 4.2 and describe the position of P using 
Cartesian coordinates, (x,y), which are 


x=Ilsin@d and y=-—lIcos@. (4.9) 
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The force acting on P is the vector sum of the tension, having Cartesian 
coordinates (—T sin @,T cos @), and the gravitational force (0, mg). Hence 
the two components of Newton’s equations of motion are 


mz=-—Tsind and mi =Tcosé@— mg. (4.10) 


Thus we have two equations and three unknowns, x, y and J’. However, 
because the length OP is fixed, x and y are related with x? + y* = 17. The 
tension TJ’ may be eliminated from equations (4.10) by multiplying the equa- 
tion for « by cos@, the equation for y by sin@ and adding the resulting 
equations, to obtain 


m(x%cosé + ysiné) = —mgsiné. (4.11) 


But, from the geometric relations of equation (4.9) we have, on differentiat- 
ing each twice, 


= 16 cos 0, z= (cos — 6? sin 8) 


| ' | (4.12) 
=o g=3 (sino 4 6? cos 8) | 
so ¢cos@ + ¥sin@ = 16, and equation (4.11) becomes 
d?0 g. 
dt = ~% sin 6. (4.13) 


This is the required equation for #: it is independent of the mass m and 
depends upon / and g only via the ratio g/l. Notice that, as suggested 
above, this equation of motion depends periodically upon @. It can be shown 
that the solutions of this differential equation cannot be expressed as finite 
combinations of elementary functions. 


You should note that the main objective of the analysis between equa- 
tions (4.9) and (4.13) is the removal of the tension T. In Section 4.5 we 
shall see how to derive the equations of motion without introducing the 
tension. 


In this chapter, we shall not usually concern ourselves with the solutions of 
the equations of motion, but this problem is a little special so we note that 
if |0| is small, sin? ~ 6 and an approximation to the equation of motion is 


d+uw79=0, w= /$ (4.14) 


which has the general solution #(t) = Asin(wt + a), where the amplitude, 
A, and the phase, a, are constants, depending upon the initial conditions. 


Note that 0(t) = Asin(wt + a) is an exact solution of an approximate equa- 
tion, and it is not a priori obvious that it is also an approximate solution 
of the exact equation. In this case the solution of the exact equation (4.13) 
is known so we know that for sufficiently small times, the solution of the 
approximate equation is close to this exact solution. 


Thus the motion is periodic with period 27/w, which is independent of the 
amplitude, so the system is said to be itsochronous, the property that makes 
the pendulum so useful as a time-keeper. The isochrony of the pendulum 
was first explicitly stated by Galileo (in a letter dated 1602), who at the 
same time noted that the period was independent of the mass. That the 
period is proportional to the square root of the length, 1, was later discovered 
by Galileo, and published in 1632. If the amplitude of the motion increases 
too much, the approximation sin@ ~ @ breaks down and the period of the 
motion then depends upon the amplitude, see Exercise 4.29 (page 184). 


4.2 Newton’s laws 


A moving point of support 


We now consider a variant of the simple pendulum in which the hinge is not 
rigidly attached to a stationary point, but is fixed to a heavy particle, of 
mass J/, that can slide smoothly along a straight horizontal rail, as shown 
in Figure 4.3. In this example we use a coordinate system fixed relative 
to the rail, with the y-axis vertically upwards and Oz along the rail. The 
resulting analysis is easier than if we used a moving origin at A. 


Figure 4.3 Schematic diagram of a simple pendulum with its hinge attached, at A, 
to a particle of mass M that can slide freely on a horizontal rail along the x-axis. 
As in Figure 4.2, the rod AP is rigid, light, inextensible with length /, has a 
particle of mass m firmly attached at P and can swing freely at A. 


The reason for treating this system is to illustrate how to analyse a system 
that requires two variables to define its configuration. As before, there is the 
angle # between the pendulum and the downward vertical: in addition there 
is the distance, X, between the particle at A and a given stationary point, 
O, on the rail. The tension in the pendulum now acts on A and causes it to 
accelerate. 


Using the coordinate system Oxy shown in the figure, we see that the Carte- 
sian coordinates of P are (X +/sin 6, —l cos @), as in equation (4.9), but with 
the distance X added to the x-coordinate. ‘The equation of motion of the 
mass at A is 


MX =Tsin6@, (4.15) 
and the equations of motion for the mass at P are 
d? ae 
i =-—Tsin@ and | Sa3 = T cos™ — mg. (4.16) 


These are identical to the equations of the fixed pendulum, equations (4.10); 
however, this motion is affected by the motion of the mass at A through the 
tension T. Comparing the equations for 7 and X, we see that MX + ma = 0, 
that is MX + mé = constant; the momentum in the z-direction is constant, 
which is just Newton’s first law applied to the centre of mass of the system 
— we expand upon these ideas later, so do not be concerned if they are 
unfamiliar. 


We now need to eliminate the tension from equations (4.16), but because 
x = X +lsin@, the analysis is not as simple as before. Equation (4.11) is 
still valid, but now differentiation gives 


¢ = X +16 cos8, = X +1 (Gcosd — 6 sind), (4.17) 


y = 16 sind, 7=1 (sind + 6° cos 8) (4.18) 
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so cos? + Hsind = X cos6 +16 and the equation of motion (4.13) is re- 


placed by 

X cosé + 16 = —gsin@. (4.19) 
Now we use the relation mi = —MX with second of equations (4.17) to 
obtain 

X =—-pl (4 cos 6 — 6? sin 0) y = gras (4.20) 

M+m 

By substituting this into equation (4.19) and rearranging we obtain 

(1 — cos? 0) 10 + 16” sin 6 cos 6 + gsin@ = 0. (4.21) 


Notice that the equation for 6 does not depend upon the other variable, X: 
there is a good reason for this which will be discussed later, in Section 4.5.1. 


The equation of motion (4.21) looks quite complicated. In fact, this equa- 
tion can be reduced to a simpler first-order equation, but we leave this 
simplification until Section 4.5.1, specifically equation (4.107) (page 186). 


Exercise 4.3 


Show that if M is sufficiently large, so that 4: can be approximated by zero, equa- 
tion (4.21) reduces to the original pendulum equation, equation (4.13), and the 
equation for X, becomes X = 0. Explain the meaning of the latter equation. 
Exercise 4.4 
Show that the quantity 

E=}(M+m)X?+ iml?6? + ml@X cos @ — mgl cos 6 


is a constant of the motion. [Hint: compute E and use the equations of motion, 
4.19 and (4.20), to show that this is zero.| 


4.2.3 The double pendulum 


The double pendulum was studied extensively in the 18th century by Daniel Daniel Bernoulli, 
Bernoulli and Euler, with three published works between 1738 and 1741, and 1700-1782. 
by Johann Bernoulli in 1742: but only for the small amplitude oscillations Leonhard Euler, 


were the correct equations of motion obtained. Thus, half a century after ‘pelicans 
nahin ; ne: Johann Bernoulli, 
the publication of Newton’s laws, their application was not routine. 1667-1748. 


The double pendulum is shown schematically in Figure 4.4: it comprises a A discussion of the history 

simple pendulum suspended from another simple pendulum by a frictionless and evolution of dynamics 

hinge. Both pendulums move in the same plane. during this period is given 
in The evolution of 
dynamics: vibration theory 
from 1687 to 1742 by 
J. T. Cannon and S. 
Dostrovsky (Springer, 
1981). 


4.2 Newton’s laws 


02 
| 
P, 
m2g 
Figure 4.4 The double Figure 4.5 Diagram showing the 
pendulum. forces acting on each particle of 


the double pendulum. 


In this system the rods OP; and P,P. are rigid, light and inextensible. The 
masses ™m, and mz are fixed at P, and P> and the two hinges at O and P, 
are smooth. Further, we assume that neither 6; nor @2 is constrained so 
both may take all values. 


The configuration of this system is given uniquely by the values of the angles 
0, and 6, and the system is said to have two degrees of freedom. We 
expect Newton’s equations to involve only these angles and their derivatives. 
Further, the configurations corresponding to (6; + 2n,7, 62 + 2ng7), where 
n, and ng are integers, are physically identical, so the equations of motion 
must be 27-periodic in these angles. 


The derivation of the equations of motion that follows is complicated. Our 
aim is to show directly the difficulties that can arise when Newton’s equa- 
tions are applied to relatively simple mechanical problems, with the hope 
that you will appreciate the efforts made by Lagrange to provide a more so- 
phisticated, yet simpler method. You will not be assessed on this material, 
but please read it and note the opportunities for error. 


Equation of motion for P; 


As with the simple pendulum we first need to find the Cartesian coordinates 
of P,; and P» and, as in that case, we use a reference frame Oxy with 
the origin at O and the y-axis vertically upwards, as shown in Figure 4.4. 
The coordinates of P, are given by equation (4.9), with a suitable notation 
change: 


xj =l,sin@,; and y, = —l, cos}. (4.22) 


The coordinates of Pp with respect to P; are similar, (l2 sin 62, —l2 cos 42), 
so the coordinates with respect to O are 
v2 = 24, + losin dg = 1, sin #; + losin 9, 


(4.23) 
y2 = y1 — lg cos 82 = —l, cos 4; — lo cos O2. 


The equations of motion for P, are similar to those derived for the simple 
pendulum, equation (4.10), because the forces on this mass are similar, 
compare Figure 4.5 and 4.2. These equations are 

d* x9 d° 


m2 =—T5sin0. and mo = Ty cos 02 — mag. (4.24) 
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As before, we need to eliminate 7> from these equations. This is achieved 
by multiplying the first equation by cos 42, the second by sin 42 and adding 
the results to obtain 


mg (#2 cos 2 + to sin 82) = —maqg sin 2, (4.25) 


which is the equivalent of equation (4.11). The next step, however, is more 
difficult because both x2 and y2 depend upon 6; besides 62. We require the 
accelerations %2 and Y2 to be expressed in terms of the angles: for this we 
first differentiate equations (4.23) with respect to t, to obtain 


oe 1161 cos 4, + Io 05 cos6>, go'= 1161 sin 6; + 1505 sin 4. (4.26) 
A second differentiation gives 
X92 = ly (4; COS 01 = 9? sin 0: is ly (4, COS A> = 03 sin 0.) , 


. : Z (4.27) 
Y2 a Ly (4, sin 0; + 0? cos 0, + I» (4, sin Ao + 03 COs 0) : 


From these, we obtain 
L9 COS Ao + yo sin A> = ly (4; cos(4, — 65) _ 9? sin(O; = 00) = 2 1269, (4.28) 


where the identities 


cos(u — v) = cosucosv + sin usin v, 


sin(u — v) = sin ucosv — cosusinv Pee 
have been used. 
Hence equation (4.25) becomes 
Os cos(6; — 02) — 1,0? sin(@; — 62) + 150 = —gsin 69, (4.30) 


which is one of the required equations of motion. Notice that this equation 
is independent of the masses and that if the rod OP, is held stationary 
(0, = 0), this equation becomes equation (4.13), with suitable changes of 
variables, as should be the case. 


Equation of motion for P, 


The force on P; involves the tension 75, so we first need to use equa- 
tions (4.24) to express T> in terms of the angles. This is achieved by multiply- 
ing the first of equation (4.24) by sin 62, the second by cos 62 and subtracting 
to obtain 


mg (#2 sin 82 — #2 cos 82) = —T> + moqg cos bo. (4.31) 


Now use equations (4.27) again to express the left-hand side of this equation 
in the form 


L9 sin A> = Y2 COS A> = = sin(@; a 2) — 1,0? cos(6; ~~ 05) ae 1263,(4.32) 
hence 
I5 = ms (9 cos 9 + 116; sin(@; — 02) + 1,0? cos(6; — 02) + I263) «(Ae 


The force on P; involves both tensions, T; and 7, and the gravitational 
force, so its equation of motion is more complicated than that of Py. The 
x- and y-components of the force acting on P; are 


Fi, = —T, sin 6; + To sin 62, 


(4.34) 
Fiy = Ti cos 0; — Tz cos 02 — myg, 
so the equations of motion are 
m2, = —T, sin 6; + T> sin 4, (4.35) 


MY = qy COS Cal _ T> COS A> — mg. 


4.3 A discussion of Newton’s laws 


Since 75 is known, equation (4.33), we need only eliminate 7, from these 
equations to find the equation of motion. As usual this is achieved by 
multiplying the first equation by cos 6;, the second by sin @; and adding, to 
obtain, 


M1 Gia COS Cal + YI sin 0,) = —T>5 sin(@; aa 05) = 719g sin 1. (4.36) 


But, as in the analysis following equation (4.12) we have #1 cos 6; + yj sin@, = 
1,6,, so this equation of motion becomes 


my41,6} = —T>5 sin(), ae 63) — ™1g sin 01. (4.37) 


We can now use the expression (4.33) for T> to simplify this equation. Note, 
however, that the algebra is complicated and, if you wish, ignore it and go 
straight to the final result, equation (4.40). 


On substituting for T>) we obtain 
my11161 = —m1gsin 6, — mgsin 449 (5 cos Oy + 1161 sin O19 
+ 1,0? cos O49 + 1263) (4.38) 


where 612 = 6; — 62. In this expression the term sin? 6. occurs and we 
replace it by 1 — cos? 642, then rearrange the equation to 


(m1 + mg) 1,6, = —g (mi sin 0; + m2 sin 619 cos 0.) 
— mg ( 116; cos 649 + 1,0? sin 019| cos O19 


+ p62 sin 612) - (4.39) 


Now use equation (4.30) to replace the term in square brackets to obtain 


7 and : . 
1,6, + gsin@; + os @ cos(6; — 05) “| 05 sin(@; — 02) aR (4.40) 
This and equation (4.30) are the final equations of motion for the double 
pendulum. They are a pair of coupled, nonlinear, second-order differential 
equations for 6, and 92, the solutions of which cannot be expressed as a 
finite combination of known functions. Notice that the masses m, and m2 
occur only as the ratio m2/(m, +mg). A far simpler method of deriving 


these equations is given in Section 4.5.2. 


4.3 A discussion of Newton’s laws 


In the previous section Newton’s laws were quoted and then applied to some 
representative problems. Although sometimes a daunting task, particularly 
the last example, these applications were relatively straightforward, provided 
panic is avoided — but it helps that the collective wisdom accumulated over 
the past four centuries ensures that the correct answers are known. For 
novel systems of any complexity the derivation of the equations of motion 
is not easy and there are many opportunities for error. Lagrange’s method, 
described in the next section, makes this task easier. 


Newton’s laws were presented together in Principia I (1687), but the ideas 
had been evolving for some time before. Indeed, Newton stated that the 
first two laws were known to Galileo and they were stated in Horologium 
Oscillatorium, published by Huygens in 1673. The third law is new: Newton 
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applied this specifically to collisions, and it is related to momentum conser- john Wallis, 1616-1703. 
vation, previously described by Wallis, Wren and Huygens in submissions Christopher Wren, 
to the Royal Society in 1668. 1632-1723. 


Newton’s laws are not self-evident. Indeed, from the time of Aristotle to Aristotle, 384-322 Bc. 
the 17th century, when Galileo stated the first law (published in 1612), the 
contrary was believed. The Aristotelian description of motion is complicated 
and includes the view that the natural state of a body is one of rest, which 
follows from the notion that everything must be moved by something, a 
proposition that is in direct contradiction to the modern idea of relative 
motion; Galileo’s analysis, discussed briefly in Section 4.5.1, changed the 
description to something very close to the modern view. Nevertheless, even 
today, for the Earth-bound, it takes some effort of thought to overcome the 
Aristotelian view. This is why the first law was so significant, and why it is 
stated separately even if redundant in modern physics. 


After 1687 a great deal of effort was expended in applying Newton’s laws, 

in attempts to understand to which physical systems they were applicable, 

in trying to find solutions to various problems, particularly those related to 

the solar system and in trying to understand the nature of these solutions. 

The first problem was to derive Kepler’s laws from first principles. These 

three laws were published by Kepler between 1609 and 1619. The first two, Johannes Kepler, 
published in 1609, are: (i) Planets describe ellipses round the Sun, the Sun 1571-1630. 
being at one focus; (ii) the line joining the Sun to any planet sweeps out 

equal areas in equal times. The third law, published in 1619, states that 

the squares of the periods of the planets are proportional to the cubes of 

the major axes of their orbits. The first two laws were deduced from careful 
observations of the orbit of Mars. The theoretical origin of Kepler’s third 

law is discussed in Exercise 4.53. Amongst many other works, Kepler also 

suggested that tides were caused by the attraction of the Moon. 


The next major triumph was the accurate prediction of the return date of 

what we now call Halley’s comet. Comets have been observed since ancient The word comet is from 
times as isolated events. In the West, the oldest record is a Babylonian in- the Greek for ‘long-haired 
scription dating from 1140 Bc, but in China records date from 1600 BC, with 8¢» a reference to the 
an observation of Halley’s comet in 467 BC. However, the notion of comets ee 

returning had not been considered seriously much before the time of New- 

ton. ‘Tycho Brahe carefully observed the 1577 comet and concluded that it Tycho Brahe, 1546-1601. 
was in an approximately circular orbit round the Sun; hence giving credence 

to the view that comets might be part of the solar system. During the 17th 

century various authors suggested that comets move on parabolic orbits, 

and Newton used observations of the great comet of 1680 (now thought to 

have a period of about 8800 years) to compute its orbit which he estimated 

to be a parabola with the Sun as its focus. Halley then attempted to com- Edmond Halley, 

pute the orbits of 24 comets for which sufficiently accurate data was known 1656-1742. 

and found that among these 24 examples three, those of 1531, 1607 and 

1682, seemed almost identical and that the observations were separated by 

the almost equal intervals of 76 and 75 years. Halley concluded that these 

orbits were actually ellipses corresponding to the same comet, which would 

return in 1758. 


The precise calculation of the return date was not easy because the orbit 

is perturbed by Jupiter and Saturn. In 1748 three French mathematicians 

Clairaut, Lalande and Lepaute sequestered themselves to calculate the re- Alexis Chaude Clairaut, 
turn date of the comet, last seen in 1682. Lalande wrote: “During the six 1713-1765. 

months we calculated from morning to night, sometimes even at meals; the Joseph-Jéréme le Francais 
consequence of which was, that I contracted an illness which changed my Lalande, 1732-1807. 
constitution for the rest of my life.” The difficulty in accounting for the ef- Nichole-Reine Lepaute, 
fects of Jupiter and Saturn was exacerbated by the fact that probably the 1723-1788. 
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only calculating aids they had were logarithms. These approximate compu- 
tations of the orbit predicted the date of return to its perihelion to be April 
1759; its actual return in March, just within the errors quoted by Clairaut, 
provided confirmation that Newton’s laws were more widely applicable than 
originally thought. 


The trajectory of Halley’s comet has now been computed backwards and all 
30 previous passages described in historical documents, over 22 centuries, 
have been authenticated, including the 1066 passage shown in the Bayeux 
tapestry. 


It took longer, however, to understand the behaviour of the solutions of 
Newton’s equations — over two hundred years after their first publication, 
in 1898 Poincaré showed that these could behave in the most alarmingly 
complicated manner, which we now name chaos. 


The simplicity of Newton’s laws is deceptive: they combine definitions with 
observations from nature, and use partly intuitive concepts together with 
some unexamined assumptions about the properties of space and time. 
These ideas are conceptually difficult and it was not until two hundred years 
after Newton that the Austrian physicist Mach in 1883 analysed some of 
these problems logically: this type of analysis played a significant role in the 
development of Einstein’s special theory of relativity. 


The first law implies the existence of isolated bodies and inertial coordinate 
systems: this needs some explanation. Any experiment is performed within 
an assumed reference frame, usually defined by the laboratory in which it 
is performed. In principle, experiments done in a particular laboratory will 
allow one to deduce, or at least confirm, physical laws and in particular the 
laws of mechanics. However, the same experiment performed in different 
laboratories may produce different results. For instance, a laboratory in a 
uniformly accelerating space rocket, or fixed to the end of a large rotating 
beam, would not confirm Law I, without taking into account the acceleration 
of the laboratory, of which the scientist may not be aware. A simple example 
of this effect is the motion of a particle on an accelerating wire considered 
on page 148. 


An inertial reference frame is one in which Law I holds; essentially, Law I 
asserts the existence of an inertial frame. Further, it is assumed that the 
laws of motion are invariant under transformations between inertial reference 
frames. The limiting case considered in Exercise 4.3 is an example, because 
the uniform motion of the pendulum support does not affect the pendulum 
motion. ‘Thus, two laboratories moving uniformly relative to each other 
would deduce the same dynamical properties for the pendulum. This is 
Galileo’s relativity principle which states that space and time are absolute; 
that length, time and mass are independent of the relative (uniform) motion 
of the observer. A consequence of this principle is that the speed of light de- 
pends upon the relative motion of the observer, an inference contradicted by 
the experiments of Michelson and Morley in 1887. Despite, this, because the 
speed of light is so large, about 3 x 10/°cems~', or 186,000 miles~!, Galileo’s 
relativity principle is approximately true in many important situations. In 
an inertial reference frame the second law implies the first. 


Newton’s second law relates the ‘change in motion’ to an impressed force. 
Here the term ‘motion’ means the momentum, a vector quantity we often 
denote by p. Newton used the term ‘change’ rather than the more modern 
‘rate of change’ because his experimental examples involved impulses; one 
such example is described after Exercise 4.30 (page 185). For a body of 
constant mass, its momentum is usually the product of its mass, m, and 
its velocity, v; the main exception being the motion of charged particles in 
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electromagnetic fields, which we shall not consider. The modern version of 
Newton’s second law is, in symbolic form, 
dp 


d 
a For, for constant mass, en “aa (4.41) 


where F is the force acting on the body: the first of these equations applies 
when the mass varies, as in rocket motion. These equations are known as 
Newton’s equations of motion. 


The notion of a force is difficult. First we observe that it is a vector. Second, 
implicit in this law is the fact that forces add vectorially, a fact that Newton 
added as a corollary; this result is an empirical law, crucial for our under- 
standing of many particle systems. The first observations leading to this 
law were those of Galileo in his description of projectile motion, in which he 
separated horizontal and vertical components of motion. 


Forces always arise from interactions between particles, and if we ever found 
an acceleration without an interaction, we would be in a dreadful mess. It 
is the interaction which is physically significant and which is responsible 
for the force. For this reason, when we isolate a body sufficiently from 
its surroundings, we expect it to move uniformly in an inertial frame. An 
isolated body is one from which interactions have been eliminated — logically, 
this means that isolated bodies cannot be seen, because observing an object 
means interacting with it using, for instance, light; however, light exerts a 
very small force so we can see only approximately isolated bodies: a deep 
analysis of this problem leads to Heisenberg’s uncertainty principle and to 
quantum mechanics, which is beyond the scope of this course. 


4.3.1 Internal and external forces 


In this and the next section we introduce the idea of generalised coordinates, 
starting with a discussion of N interacting particles. If you wish to have in 
mind a realistic example, for N = 3 think of the Sun—Earth—Jupiter system 
as a set of 3 interacting particles: another is the Sun—Earth—Moon system. 


Any pair of isolated particles will interact through a force acting along the 
line joining their centres because, by definition, this is the only preferred 
direction The force on particle 7 due to particle 7 is denoted by F;;, and 
similarly the force on j due to 2 by Fj;. Newton’s third law is satisfied only 
if F;; = —F;;. Thus for the two particle Sun—Earth system the attractive 
forces are as shown in Figure 4.6. 


Sun 


Mein 
@ Earth 


Figure 4.6 Schematic diagram of the interaction between the Earth, of mass mp, 
and the Sun, with mass msg, often accurately described by assuming both to be 
point particles. 


Note that in this case the magnitudes of the force on each particle are 
identical, but the effects on each are quite different, simply because the Sun 
is far more massive than the earth, mg = 328 900mg; a consequence of this 
is explored in Exercise 4.5. 


The interaction between any pair of particles is unaffected by any third 
particle. The pairwise forces F;;, which usually depends upon the positions 
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of particle 2 and 7, are normally named internal, for reasons that will become 
clear later; see equation (4.45). 


In addition to these pairwise, internal forces each particle of a system may 
be affected by external forces, due to another system apart from that being 
considered. Usually external forces are gravitational or electromagnetic in 


origin. We denote the external force on particle i by FO), An example show- 
ing the difference between internal and external forces is discussed next and 
from this we shall see that this distinction depends upon the approximation 
used to describe the system. 


Consider the Sun-Earth—Jupiter system, with each body being treated as 
a single particle, with the aim of understanding the motion of the Earth. 
It is possible, and a good approximation, to consider this system of three 
particles in isolation and then the only forces are the internal forces between 
the three particles: there are no external forces. This, however, gives rise to 
equations of motion that are very hard to solve. 


The force between the Earth and Jupiter is, however, relatively weak by 
comparison to that between each planet and the Sun, so the simplest first 
approximation is obtained by ignoring this interaction, and then the Earth 
and Jupiter orbit the Sun unaffected by each other. This approximation 
conforms to Kepler’s description of the solar system giving slightly elliptic 
orbits at mean distances of 1 AU and 5.2 AU, respectively, from the Sun, 
and with periods of one (tropical) year and 11.86 years, respectively. 


The weaker interaction between the Earth and Jupiter has an asymmetric 
effect because Jupiter is so massive, my ~ 320m, which means that a better 
but more complicated approximation, may be obtained by assuming that the 
motion of Jupiter is unaffected by the Earth, but that the Earth is affected 
by Jupiter. This asymmetry is represented mathematically by treating the 
effect of Jupiter as an external force on the Earth, as shown schematically, 
in Figure 4.7. In this approximation to the motion of the Earth the only 
internal force is that between the Earth and the Sun, and the resulting 
equations of motion are easier to understand. 


Figure 4.7. Diagram showing how the effect of Jupiter on the motion of the Earth 


may be represented by a time-dependent external force, Fe mM with the force on 
Jupiter due to the Earth being ignored. 


Note that at different times of the year this external force is in different 
directions and has different magnitudes, so depends upon the time. 


For N particles each could be affected by the N — 1 interactions with the 
other particles. The motion of two interacting particles is well understood 
and solved in any text book on dynamics. The motion of three or more inter- 
acting particles is, however, exceedingly difficult to understand. Indeed, the 
three-body problem remains one of the most celebrated problems in celestial 
mechanics; it can be simply stated as follows. Three particles move in space 
under their mutual gravitational attraction; given their initial positions and 
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velocities determine their subsequent motion. The importance of this prob- 
lem can be judged from the fact that between 1750 and 1900 more than 800 
papers on this topic were published and new work is still being produced. 


In general, for N interacting particles, where the 7th has known mass, mj, 
and position vector r;, with Cartesian coordinates, r; = (2, yj, 2), with 
respect to a given reference frame, there are 3N degrees of freedom. The 
three Newtonian equations of motion for each of the N particles are, in 
vector form 

d*r; (e) . 

a) 5 

ees oe a eee 2 (4.42) 

= 1 

i#i 


Note that in this equation the 7 =i term of the sum is omitted because 
the quantity F;; is not defined: in some texts, for algebraic convenience, it 
is defined to be zero. More important you should note that normally all 
forces depend upon the position vectors. These 3N equations are the math- 
ematical statement of Newton’s second law. In these differential equations 
we normally know all the masses and all the forces, which usually depend 
upon the position vectors in a known manner: the objective is either to solve 
these equations for r;(t) given some initial conditions, that is r;(to), ri(to), 
k = 1,2,...,N, or to obtain some qualitative understanding of the solution: 
for instance, in studies of the solar system we should like to know whether 
the distance between the Earth and the Sun remains within given limits for 
all times. In recent years this type of problem has become of public concern 
with various groups worrying about the distance between the Earth and 
individual asteroids of the larger variety. 


In principle, equations (4.42) may be solved, possibly numerically, for each 
of the 3N variables provided all the forces together with all the initial con- 
ditions are known. In practice, except for the simplest of systems, these 
equations can be solved only by numerical integrations and then, with a 
given accuracy, only for a finite time, which depends upon the accuracy re- 
quired: furthermore, it is usually difficult to understand the properties of a 
system from numerical solutions. 


4.3.2 The centre of mass 


Before proceeding it is worth noting one of the consequences of Newton’s 
third | law, which manifests itself in the symmetry relation F;; = —F;;. ‘This 
involves the centre of mass of a system of N particles which is defined by 
the vector R, 


N N 
MR = ) ined = ete (4.43) 
pal eonad 


so that M is the total mass. We now show that the centre of mass of a 
system of N particles moves like a single particle, of mass M equal to the 
total mass of the system, acted upon by the sum of all the external forces, 
and that this motion is independent of the internal forces. 


This is shown by differentiating the vector R twice and then using equa- 
tion (4.42) to obtain 


. N N NO 
MR = a et: = - Fo <7 ae 7: F;;. (4.44) 
t=1 
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But in the second sum, the terms F,., and F., occur, for each r and s, only 
once and occur as F,, + F, = 0; thus the double sum is zero. 


Thus the equation of motion for the centre of mass is 


ee N 
wey (4.45) 
4 | 


which is Newton’s equation for a particle of mass M at R, acted on by 
the sum of all external forces. This means that the centre of mass motion is 
unaffected by the forces between the constituent particles, which is why they 
are named internal forces. If the sum of the external forces is zero, then 


R = 0 and the centre of mass is either stationary or moves with constant 
velocity: in either case, it is then usually convenient to put the coordinate 
origin at the centre of mass. 


Equation (4.45) also means that a rigid body can often be treated as a single 
particle with all external forces acting through the centre of mass. 


We have already seen an example of equation (4.45). For the simple pendu- 
lum hanging from a freely moving mass, the equations of motion being given 
by equations (4.15) and (4.16) (page 151), it was noted that MX + mz = 0, 


but this is just the z-component of R because there are no external forces 
in this direction. 


Exercise 4.5 


(a) The centre of mass, C’, of two particles of mass m, and m2, with position 
vectors r; and ro, is given by equation (4.43). Show that the vectors from r; 
to C and rg to C are, respectively, 


= R—- | fo = R-—- r9 
Mao and M4 
SS Se FG = <reee Fig 
my, + M2 my, + M2 


where rio = re — Yr; is the vector from m, to mo. 


(b) The masses of the Earth, mp, and the Sun, mg, are related by mg = 328 900mg, 
and the mean distance between their centres is 149.6 x 10° km. Use the result 
derived in the first part of this exercise to show that the centre of mass of the 
the Earth—Sun system is a distance of 455 km from the Sun’s centre. 


(c) The two most massive bodies in the solar system are the Sun and Jupiter, 
with masses related by mg = 1047.4myj, and the mean distance between their 
centres is 778.3 x 10° km. Show that centre of mass of the the Jupiter-Sun 
system is a distance of 7.43 x 10° km from the Sun’s centre. Note that the 
radius of the Sun is 6.91 x 10° km. 
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4.3.3 Generalised coordinates 


In any description of a dynamical system it is necessary to specify the po- 
sitions and velocities of the constituent parts, and our objective is to deter- 
mine how these change with time. The positions are always defined with 
reference to a coordinate system, usually fixed or moving with a constant 
velocity. The simplest coordinate system is the Cartesian system with the 
equations of motion being expressed in terms of the Cartesian coordinates of 
each particle. Although the simplest system, it rarely results in the simplest 
equations of motion, so it is almost always necessary to use other coordinate 
systems. ‘The choice of coordinate system is so important to our understand- 
ing of any equations of motion that it is important to be able to formulate 
these equations in terms of appropriate coordinates. 


Generalised coordinates and the associated theory provide this flexibility 
by enabling the equations of motion to be written in terms of the smallest 
number of equations. You have already seen such coordinates in action when 
the equations for the two Cartesian coordinates of the simple pendulum, 
equations (4.10), became one equation for 0, equation (4.13): in this example 
@ is the generalised coordinate which automatically allows for the length of 
the pendulum to be constant. 


Another example is the system comprising two particles connected by a light, 
rigid inextensible rod, of length J; the two position vectors r; and ro are 
not independent because they are constrained by the relation |r, — ro| = I: 
a hydrogen molecule in which the internuclear distance is assumed fixed 
is an example, which is a good approximation at low temperatures. A 
coordinate system describing these two particles needs to take into account 
the constraint. 


Another example is a single particle constrained to move along a curved 
wire, or on a surface: in these cases the position vector is allowed to move 
in only one or two dimensions respectively and the coordinate system used 
needs to reflect this. 


Constraints are important because they limit the possible configurations of 
the system and this simplifies the equations of motion. They are introduced 
because we need to approximate the system to obtain manageable equations 
of motion involving only the essential variables. For instance, the solid block 
(which may be treated as a single particle), of mass M, sliding smoothly on 
a fixed, horizontal surface, shown on the left in Figure 4.8, is constrained to 
move in two rather than three dimensions. 


R 


Mg Mg 


Figure 4.8 Diagram showing the forces on a solid block on a horizontal surface 
(left) and a tilted surface (right). 


Gravity exerts a downward force, Mg, on the block, but on the left the 
surface does not allow the block to move in this direction, so Newton’s first 
law shows that there is no net vertical force on the block, and the third 
law implies that there must be an upward force of reaction from the surface 
to exactly compensate the downward force of gravity. This is denoted by 
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R in the figure, and is perpendicular to the surface. This reaction can be 
rationalised by considering the forces between the constituent molecules of 
the block and the surface. But for most purposes this is not helpful and 
Newton’s third law cuts through this complex atomic description to a far 
simpler and usable description, giving R = Mg. For a smooth surface the 
force of reaction is perpendicular to the surface. 


Now tip the surface, 4B, as shown on the right in the figure. The reaction 
R remains perpendicular to the surface, because the molecular interactions 
are unaffected by gravity, and is no longer parallel to the gravitational force, 
so there is a net downward force. Provided R remains perpendicular to the 
surface, the second law shows that the block slides down the plane. This is 
what happens if there is no friction and then the value of R does not affect 
the motion. In this example the reaction R is known as a force of constraint, 
and is similar to that introduced to determine the equations of motion of 
a bead on a wire, in Section 4.2.1. The reaction R is a consequence of the 
geometric constraint restricting the block to lie on the table. Generally, ge- 
ometric constraints are associated with forces that impose these constraints: 
these are known as forces of constraint. 


Another simple example is the pulley system shown in Figure 4.9. 


mg 


Mg 


Figure 4.9 A simple pulley system comprising a light inextensible rope fixed to the 
ceiling at A, a pulley B, also fixed to the ceiling, and a second, pulley C’, of mass 

M, and a particle D of mass m, both free to move in the vertical plane. The rope 
passes round both pulleys and is attached to the particle D. 


In this example we assume that the pulleys rotate with no friction, have no 
rotational energy and that the rope does not stretch. ‘These assumptions 
mean that the tension J in the rope, not shown in the diagram, has the 
same value at all points. The constraint is that the length of the rope is 
constant, which means that the heights of the particle D and the pulley 
C are not independent (provided the rope does not break). This system is 
analysed using Lagrangian methods in Exercise 4.28. 


There are different types of constraints. In this course we consider only 
those that can be expressed in terms of equations like 


f(ri,ro,...,rn,t) =0 (4.46) 


where f is a function of the position vectors and, possibly, the time ¢. For 
instance, the block on the left-hand side of Figure 4.8 has the constraint 
z = constant, where the z-axis is vertically upwards and there is a similar 
relation when the plane is tipped. For the pulley system shown in Figure 4.9, 
the two distances between the pulley C and the particle D and the ceiling, 
Y and y respectively, are related by 2Y + y = constant, because the rope is 
inextensible. 
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Constraints of the type defined in equation (4.46) are named holonomic 
constraints and a system with such constraints is named a holonomic system. 
Other types of constraints exist and are named nonholonomic constraints, 
but are not considered in this course. 


Before describing the properties of holonomic constraints, we list a few ex- 
amples. 


A bead 

Constrained to slide on stiff wire with a fixed shape, as in Section 4.2.1, a 
bead has one degree of freedom because its position is uniquely described 
by a single variable. 


A moving wire 

A variant of the previous example is obtained by moving a wire in a pre- 
scribed manner. A single variable defines the position of the particle so the 
system still has one degree of freedom. 


Alternatively, we could allow the shape of the curve to change in a prescribed 
manner: the system still has one degree of freedom; see Exercise 4.48 for an 
example. 


The pendulum 
The simple pendulum has one degree of freedom and the double pendulum 
has two. 


Motion on a surface 
A point on a two-dimensional surface, stationary or moving in a prescribed 
manner, requires two variables, so a particle sliding on such a surface has 
two degrees of freedom. 


A block sliding on a plane surface 

A block sliding on a plane surface has three degrees of freedom, because the 
centre of mass of the block requires two coordinates to define its position. 
A third coordinate is needed to define the orientation of axes fixed in the 
block relative to axes fixed on the surface. 


Child’s spinning top 

With the point of contact, O, fixed, a child’s spinning top has three degrees of 
freedom because an axis OP, through the point of contact at O, fixed along 
the symmetry axis of the top, requires two angles to define its orientation, 
the vector n, and the angle of rotation about this axis, ¢; see Figure 4.10. 


Figure 4.10 Diagram showing the coordinates of a child’s top. 


If the point of contact is allowed to move around the plane, then two more 
degrees of freedom are added. 


A solid body 

A solid body moving in space has six degrees of freedom: there are three 
coordinates for the position of its centre of mass; and three further angles 
defining its orientation. 
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In general if there are / independent holonomic constraints, defined by the 
M equations 


CAtiste, «2,85, 0) = 9, ce a eee. 2 (4.47) 


then it can be shown that the physically possible positions of the system, 
that is the values of all 3N coordinates (x;,y;,2;), i = 1,2,...,N, can be 
described by n = 3N — M coordinates q,, k = 1,2,...,n, so we must have 
M <3N —1. The number n is important and is called the number of degrees 
of freedom of the system and the n coordinates gq, are called the generalised 
coordinates. For any given system the number of degrees of freedom is 
unique, but there are infinitely many choices of the generalised coordinates. 


Generalised coordinates arising from holonomic constraints satisfy two con- 
ditions: 
(i) their values completely determine the configuration of the system: 


(ii) they may be varied arbitrarily and independently without violating the 
constraints of the system. 


The n generalised coordinates gy, k = 1,2,...,n, are frequently used to 
define coordinates in an abstract space which is named the configuration 
space. For instance a single particle moving in three dimensions has for 
its configuration space the usual three-dimensional Euclidean space. If the 
particle was confined to the Oxy-plane the configuration space would be 
this plane: but it might be confined to the surface of a sphere or a torus, in 
which case the configuration space would be different. 


There are two principal methods of dealing with holonomic constraints. One 
is to introduce the forces of constraint, for example, the reaction on the wire 
in Section 4.2.1 and the tension in the pendulum in Sections 4.2.2 and 4.2.3. 
The second method is to formulate the equations of motion in terms of 
generalised coordinates, so the constraints are built in at the beginning. In 
general, when applicable, the second method is easiest to apply: the general 
formalism is described in Section 4.4 and it is applied to some problems in 
Section 4.5. 
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Exercise 4.6 


For each of the following systems, state the number of degrees of freedom and 
suggest suitable generalised coordinate(s). 


a) A particle moving on a fixed, stationary circle. 
b) A particle moving perpendicular to a given stationary plane. 


( 

( 

(c) A particle moving on the surface of a sphere. 

(d) A rigid body rotating about an axis fixed in space. 
( 


e) A particle moving on a circle which is rotating about a diameter in a prescribed 
manner. 


(f) A particle moving on a circle which is free to rotate about a horizontal diameter. 


(g) A rigid body (i) moving freely in space, and (ii) with a stationary centre of 
mass. 


(h) A planet moving round a star, fixed in space. 


Exercise 4.7 


(a) Consider the pulley system illustrated in Figure 4.9. If the distance of the 
pulley C and the particle D below the ceiling are Y and y respectively, show 
that for an inextensible rope these variables are connected by the constraint 
ar + Y= cometant. 


(b) How many degrees of freedom does this system have? 


(c) Explain why the tension is the same along the length of the rope. 


4.3.4 Kinetic and potential energy 


In this section we introduce two concepts that are essential for the develop- 
ment of Lagrange’s equation of motion: the kinetic energy and the potential 
energy for a system of interacting particles. 


Kinetic energy 


The kinetic energy of a single particle of mass m and with velocity v is a 
scalar and is defined to be smv?, where v = |v|. For a set of N interacting 
particles the total kinetic energy, T, is defined to be the sum of the individual 
energies, 


1 = 


Nol 


N 
> m7. (4.48) 
t=] 


Note that the symbol T is also used to denote a tension: this should not 
cause confusion because these quantities normally appear in quite different 
contexts and have different units. Each component of this sum is propor- 
tional to the square of the speed of each particle, and depends on the first 
time-derivative of the associated position vector, 

dr; D) 


> (Gi;05, 2), BO wag + yr +e. (4.49) 


Each vector r; will depend upon the n generalised coordinates, gz, k = 
1,2,...,n, so each of the 3N components (4;, %;, 2;) will depend on these 


Vi= 
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generalised components and their derivatives, gx, k = 1,2,...,n. For exam- 
ple, if 
ti; = filgi, go,---,Qnst), Bate eee (4.50) 


then using the chain rule to differentiate with respect to t gives 


= => Ofi oe 
« Ok a? ee 


ae © ears, (4.51) 


with similar relations for the y- and z-components. ‘The derivatives q,z, 
k= 1,2,...,n, are called the generalised velocities. We see that the time 
derivatives (%;,%;,2;) are all linear functions of gz, but not necessarily of 
the generalised coordinates. Because all the first time derivatives of r; are 
linearly dependent on the generalised velocities, the kinetic energy depends 
quadratically on the generalised velocities. 


Two examples should make the preceding analysis clear. First, consider 
a single particle moving in a plane with Cartesian coordinates (x,y): its 
velocity is v = (“,y). In plane polar coordinates, (r, 6), we have 


f=, y= reine. (4.52) 


We may use either (x, y) or (r,@) as generalised coordinates and the respec- 
tive generalised velocities are x, y and r, 9. Differentiating the above two 
equations we obtain 

¢=?rcosd—résind, y=rsind+récosé. (4.53) 


These are the equivalent of equations (4.51), and we see that « and y depend 
linearly upon 7 and @, because of the chain rule. The speed of the particle 
is given by 


: 2 . z 
v? =i? +9 = (Fcos6 — rdsin#) + (isin 6 + r6.cos8) 
= _ 4. r6?, (4.54) 


and this depends quadratically on the generalised velocities 7 and 0, as 
expected. 


As a second example, we consider a single particle moving in three dimen- 
sions. The Cartesian coordinates are (x, y, z) and these are appropriate gen- 
eralised coordinates, with the associated generalised velocities x, y and 2. 


Another set of coordinates are the spherical polar coordinates, (r, 0, ), use- 
ful when there is a spherical symmetry. These can be defined by the equa- 
tions 


2=Pfemnlcso, ygo=rsinGsngd, «= recast. (4.55) 


The new generalised coordinates are r, 6 and ¢ and the physical meaning of 
these is shown in Figure 4.11. 


Figure 4.11 Diagram showing the physical meaning of the spherical polar 
coordinates (rT, 6, ¢@). 
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Differentiation of these equations gives 

x=rsinédcos¢@+ r cos 6 cos b — rosin @ sin ¢, 

y =7sindsing+récosésing + rdsin 6 cos ¢, (4.56) 

2=?fcos6 —résiné. 
The speed is #* + y? + 27; it is easiest to first find £2 + y?: 

e+y? =r (sin? 9 cos? ¢ + sin? 6 sin? o) 

+ 26? (cos” 9 cos’? @ + cos? 6 sin? o) 
+ r7¢? (sin* § sin? ¢ + sin? 6 cos? d) 
+ 2r7r-6 (sin § cos 6 cos” ¢ + sin @ cos 6 sin? >) 
= 7 sin? 6 + 126? cos? 6 + r7¢? sin? 6 + 2r76 sin 8 cos 0. (4.57) 

Hence on adding 27 to this we obtain 

ety + 2 = 4776? + 17d? sin? 6. (4.58) 
This is a sum of the squares of the generalised velocities, with coefficients 
depending upon the generalised coordinates. 


Exercise 4.8 

The generalised coordinates (x, y) and (u,v) are related by x =u+vandy=u-—v. 
(a) Express the generalised velocities (u,%’) in terms of («, 7). 

(b) Show that ¢? + 9? = 2 (uw? + 07). 


(c) If the relations between (x,y) and (u,v) are replaced by = u+vt and y = 
u — vt find the generalised velocities and show that #7 + y? = 2 (w? + vt?) + 
Avit + 2v?. 


Exercise 4.9 


The Cartesian coordinates (x,y) are related to new coordinates - v) by x 
2 
v 


Exercise 4.10 


A particle of mass m moves on a curve described by the equation y = f(x). If s is 
the distance along this curve from some fixed point, show that the kinetic energy 
of the particle is 


T = ma? (14 f'(z) == game. 
|Hint: use the relation ds* ~ dx? + dy”, see Figure 1.5 (page 9).] 
Exercise 4.11 


A particle of mass m moves on an Archimedes spiral which has the equation r = aé, 
for some positive constant a and where (r,@) are the plane polar coordinates. Show 
that the kinetic energy can be expressed in terms of (r,7*) or (0,0) by the equations 


pa 
T= mr (: aa _ a: T= ima?" (1 + 6°) 


4.3 A discussion of Newton’s laws 


Exercise 4.12 


Show that the kinetic energy of the pulley system illustrated in Figure 4.9 is given 
by T = (M + 4m)y*/8, where y is the variable defined in Exercise 4.7. 


Potential energy 


Next we assume that all forces are conservative,, which means that the force 
on a particle is given by the gradient of a potential function, V(r,t), with 
respect to the position of the particle. For a single particle, with position 
r= (x,y,z), this means that the coordinates of the force on it are given by 


p=-( av av av 


Bs" By Be = (4.59) 


where V = V(r,t) is a scalar function of r and possibly the time t. The 
notation OV/Or for gradV may not be familiar, but it is very useful when 
there are several particles with positions (r,,r2,...,r%,) and V depends on 
two or more of these. In such cases we often need to differentiate with 
respect to a particular r;, and this can be denoted by OV/Or;, whereas the 
alternative notation is not so convenient. 


The significance of equation (4.59) is that all three components of the force 
are defined by a single scalar function, V, which is variously named the 
potential, the potential energy or the scalar potential. Because the force is 
given by the gradient of V, potentials differing by either a constant or a 
function of only the time, give the same force, and are hence equivalent. 


Conservative forces are so named because the work done when moving round 
a closed curve in configuration space is zero. Friction is not a conservative 
force because it always opposes the motion and the work done is always 
positive; gravity, where F = —mgz and V = mgz, on the other hand, is 
conservative because the work done going up is recovered on the way down. 


Consider a particle in the three-dimensional configuration space, shown in 
Figure 4.12, with a force field F(r) and a curve C. The work done against the 
force when moving a distance or along this curve is defined to be DW = F - or, 
that is force x distance travelled. Notice that the curve C is not necessarily 
a possible motion; it can be any smooth curve that does not violate any 
constraint. 


6 


Figure 4.12 An arbitrary curve joining rg to rp. 


Hence the work done when moving along the curve C’, from rg to rp, is the 
line integral 


w= | dr - F, (4.60) 
C 


which will normally depend upon the path taken. If, however, F is con- 
servative, with F = —gradV, for some potential V, then, for small changes 
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in r, 
OV OV OV 
oV = —d ——)) —(dz=-F.- 4.61 
ae a y+ = 02 or, (4.61) 
so the expression for the work becomes 
a | = vee, (4.62) 
C 


which is independent of the path. For a closed curve, rg = rp, hence W = 0. 
If V depends upon the time, then this is held fixed in the integration round 
C’, so it is possible to have conservative, time-dependent forces. 


The interaction between two particles at r; and r;, giving rise to the conser- 
vative force F;;, is described by a potential function Vj;(r,t) that depends on 
these position vectors only through the distance r = |r; — r;|, and possibly 
(but not normally) the time, but no other position vector. Then 


OV;; OVE 

F;; = -——"" and 2" 
Or; - Or; 

Note that for V;; the suffix order is immaterial. These interactions automat- 

ically satisfy the third law F;; = —F,;. The proof of this is not central to 

this course and is therefore given in Exercise 4.42 at the end of the chapter. 


(e) 


The external forces, F;’(r;,t), are also assumed to be conservative, but 
these may depend upon the position vector r;, not just its length. A simple 
example of an external force is an electron in a uniform electric field. If 
—e is the electron charge (e > 0) and E(t) the electric field, then the force 
on the electron is F = —eE(t); this is independent of the position and the 
potential energy is just V = eK(t) -r. 


(4.63) 


Conservative forces are important: gravitational forces are conservative and Gravitational forces are 
electric forces are also conservative, as is the force due to Hooke’s law, de- conservative only for 
scribed in Block I, Chapter 1; see also Exercise 4.14. Thus atoms, molecules, Newtonian gravitation; 

; this is not true in general 
the solar system and galaxies are all held together by conservative forces. sedi: 
Important exceptions are frictional forces and air resistance. 


Not all forces can be defined in terms of potentials. However, for one- 
dimensional systems, if the force F'(x,t) depends only upon x and t because 
F is a scalar it is always possible to find a potential V(2,t), simply by 
integrating —F'(x,t) with respect to x, with t fixed: note that if the force 
depends upon x such a potential cannot be found. 


Exercise 4.13 


(a) The radial distance r from the origin is given by r? = x7 + y? + z?. Show that 


rf oF eae se 


dx or’) «(Oy Or’ OOz or 
(b) Show that a potential energy function, V(r), depending only upon the radial 
distance r, produces the force 


Fe atthe = —V'(r/r. 


Potential energies add, so the potential energy of a set of N interacting 
particles is just the sum of the 5N (N — 1) interactions between each pair, 


N-1 N 
Y= .y > Vig (ri — ryl, €), (4.64) 


i=1 j=i4+1 
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or in full 
V= Vio + Vig + Via «+> + Wn-1 + Vin 
ae ee. i oe | + Von 
eee he + Vay 
(4.65) 
+  Vy-on-1- + Vaecee 
+ Vaan. 
It follows that the interaction force on the ith particle is 
N 
OV 
F; = -— = F;;. . 
aa d j (4.66) 
a 
jFt 


If there are many sources of potential energy the total potential energy is 
always taken as the sum of the values that each would contribute on its own. 


Exercise 4.14 


Consider a spring of natural length Jp and stiffness k lying along the z-axis. As- 
suming Hooke’s law (see Block I, Chapter 1) the force acting to shorten the spring 
when it is extended to the length Jp + x, is F = —kx show that this can be derived 
from the potential function V = ka? /2. 


Gravity 


One ubiquitous force is that due to gravitational attraction. Two point 
particles of masses m, and mz attract each other with a force inversely 
proportional to the square of the distance, r, between them and proportional 
to the product of their masses, giving rise to the gravitational potential 
energy 


Gmym 
Vee <-—— Pee (4.67) 
? 
where G is the universal gravitational constant. For a particle of mass m 
at a height z above the Earth’s surface Newton showed, by assuming the 
Earth to be spherical and integrating over the Earth’s volume, that the 


gravitational potential is 

GMm 
35 
where M and R are the Earth’s mass and radius respectively. 


Vizi= 


S 
IV 


0, (4.68) 


Exercise 4.15 


Show that if R > z then the gravitational potential discussed above reduces to the 
approximation V(z) = mgz, where g = GM/R? is the acceleration due to gravity. 
Taking M = 5.972 x 10° g, R = 6.378 x 10° cm and G = 6.673 x 1078 g~! cem3 s~? 
calculate the value of g. 


? 
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Exercise 4.16 


The formula for g found in Exercise 4.15 shows that the value of g decreases as the 
height above the mean sea level increases. 


(a) Show that the relative change in g between its value at a height h above the 
mean sea level, g(h) and its value at sea level, g = g(0), is given approximately 
by 


Se 
g g R 
For a hill 800m high show that Ag ~ 2.5 x 107g. 
(b) The approximate period of a pendulum clock of length / is T = 27,/I/g. Show 
that the relative change in T' and g are related by 


Deduce that a pendulum clock accurate to 1 s/day could be used to measure 
the relative change in g between sea level and the top of an 800m hill. 


The significance of this calculation is that during his trip to St Helena (1677— 
1678), Halley was the first to measure the slowing of a pendulum clock due to 
altitude. Halley deduced that the force of gravity decreased as the distance 
from the centre of the Earth increased. 


On the same trip he noted that at sea level the same clock ran slower in England 
than in St Helena. This is due to the flattening of the Earth, though Halley 
did not know this; see the solution on page 202 for more details. 


Energy conservation 


We end this section by showing that if the forces are conservative and inde- 
pendent of time, then the total energy, E, of the system is a constant. The 
total energy of NV interacting particles is defined to be the sum of the kinetic 
and the potential energies: 


N N 
E=15S me? +V+> Vv", (4.69) 
gl a | 


where V is the potential energy due to the internal forces and defined by 
equation (4.64). The time derivative of V is, on using the chain rule, 


N 
aE >) ars (4.70) 
and since 
N 
OV 
ae > ie (4.71) 
1 
j#i 
we have 
N (e) 
dE O O f 
TD (maw D Vit ae «| 
N N 
d*r; e 
=e mie — DF — FY =i, (4.72) 
wl j=l 
j#t 


where the last equality follows from Newton’s equations of motion, equa- 
tion (4.42). Hence the total energy, EF, is a constant on each solution, with a 


4.4 Lagrange’s equations and Hamilton’s principle 


value that depends upon the initial conditions; such dynamical systems are 
named conservative systems. Note that the adjective conservative is used in 
two contexts: a conservative system involves conservative forces, but conser- 
vative forces that depend upon time do not normally produce conservative 
systems. 


For a conservative system with one degree of freedom the expression for 
the total energy gives a first-order differential equation which replaces the 
second-order equation of Newton, which generally makes the analysis easier: 
an example of this process is found in Exercise 4.1. The energy E can be 
obtained by integrating the equations of motion once, so it is often referred 
to as an integral of the motion or, because it is constant, as a constant of 
the motion. 


dese eames nee a SD ee OE De ee ei, aE eee) vce hing: 
Exercise 4.17 


(a) Consider a particle of mass m moving vertically upwards from the surface of 
the Earth, with height z above the surface. Use the potential energy defined 
in equation (4.68) to show that the total energy is 

. GMm 

R+z- 

(b) The escape velocity is defined to be the value of ¢ when z = 0 such that as 

z— co, z — 0, that is the particle can just escape the gravitational field: any 


smaller velocity results in it returning. Use the fact that E = constant to show 
that the escape speed is 


_ [2GM 
t= a 


(c) Use the values given in the following table and the fact that G = 6.673 x 1078 
cm°g~1s~? to find the escape speeds and the values of g using the formula 
found in Exercise 4.15, for the Earth, the Moon and Jupiter. 


| | Earth [Moon | Jupiter 


M/s | 5.972 x 10°" | 7.250°« 107 | 1.890 x 10° 
R/cm | 6.378 x 10® | 1.738 x 108 | 7.190 x 109 


Exercise 4.18 
Show that for a single particle with mass m, moving in a force F(r), Newton’s 
equations of motion give the rate of change of the kinetic energy, T, to be 


dT 
ae ee 
dt ‘ 


—_—_—_—_—.. see Oo 
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4.4 Lagrange’s equations and 
Hamilton’s principle 


In this section we introduce Lagrange’s equations of motion. For holonomic Holonomic systems are 
systems, these are equivalent to Newton’s equations, but are formulated di- defined on page 164. 
rectly in terms of the generalised coordinates; they avoid use of constraints, 

except in the initial definition of the coordinates, so are far easier to de- 

rive. Lagrange’s equations can be derived from Newton’s equations, but the 

analysis is fairly complicated and distracts from the main story. 


Lagrange’s method can be applied to any holonomic system of particles 
that interact through conservative forces. The configuration of the system 
is defined by n generalised coordinates q = (q1, 92,---;Qn), So the potential 
energy is a function, V(q,t), of these variables and possibly the time. The 
kinetic energy, 7’, is a quadratic function of the generalised velocities q = 
(1, 42,---,Qn). Then the Lagrangian is defined to be the difference between 


the kinetic and the potential energies, 


L(q,4,t) = T(q,q,t) — V(q, ¢). (4.73) 


The Lagrangian is a function of the generalised coordinates and velocities, 
and possibly the time; it is sometimes referred to as the Lagrangian function. 
Notice that the Lagrangian has the units of energy. 


Lagrange’s equations of motion depend upon the derivatives of the La- 
gerangian. If there are n generalised coordinates, there are n equations of 
motion, which are 


d {OL OL 
eee piss goer k =1,2,...5n. 4.74 


You will recognise these as the Euler-Lagrange equations for n dependent 
variables, derived in Chapter 3 — specifically, equation (3.71) (page 108) — but 
without the boundary conditions. This similarity suggests that Lagrange’s 
equations of motion can be derived from a variational principle, and we shall 
describe how this is done later. 


In simple cases it is easy to see that Lagrange’s and Newton’s equations are 
the same. For instance, a free particle (that is, a particle acted upon by no 
forces) with Cartesian coordinates r = (2, y,z) has no potential energy, so 
the Lagrangian is just the kinetic energy, 


L=4m(é?+9° +2). (4.75) 


In this example Cartesian coordinates are the most convenient generalised 
coordinates. Lagrange’s equation of motion for the x-coordinate is then 


ay ds 0 1V1 (=o (4.76) 
—|{|—]= ivin : 

dt \ a4 ye a ere 

which is Newton’s equation. The algebra for the remaining two coordinates 
is identical, and this leads to mr = 0. It is important that you do the next 
exercise. 


4.4 Lagrange’s equations and Hamilton’s principle 


ee 


Exercise 4.19 


For a single particle moving in a conservative force, with potential V(r,t), the 
Lagrangian is 


L= jm (4? +9? + 27) — V(r, 2). (4.77) 
Use Lagrange’s equations of motion to show that 
a 0 ree 0 waa. 2% 0 
ae) , — = ; i, —=— == , 
Ax - ae Az 


and show that these are the same as Newton’s equations of motion. 


Exercise 4.20 


For a particle of mass m moving in the Oxy-plane under the influence of a force 
always directed towards the origin, the Lagrangian is 
L=5m(¢?+y?)-V(r), r?=a2%+4+y?. 


(a) Use Lagrange’s equations of motion and Cartesian coordinates to show that 
the equations of motion are 


mé + V'(r)= =0 and mij +V'(r)2 =0. 
r 


(b) Show that with polar coordinates, (r,@), where x = rcos@ and y = rsin@, the 
Lagrangian becomes 
L = hm (i? +16?) — V(r) 
and that the equations of motion are 


2 


A 
mf ——>+V"(r)=0 and @=—,, where A is a constant. 
mr mr 


Lagrange’s equations of motion, defined in equation (4.74), are more elegant 
than the equivalent Newtonian equations because their form is independent 
of the choice of generalised coordinates, recall the discussion in Section 3.2.2, 
in particular equations (3.28) and (3.29) (page 101). The underlying reason 
for this is that they can be derived from a variational principle, which, as 
we saw in the previous chapter, is invariant under coordinate changes: this 
variational principle can be used as the fundamental principle of Newto- 
nian dynamics, in place of Newton’s three laws. This change of emphasis, 
from Newton’s laws to a variational principle, may seem an unnecessary 
complication but it is an essential step for later developments in advanced 
dynamics and theoretical physics. In addition, in the spirit of Ockham’s 
razor, it provides a single, simple principle from which everything follows. 


Hamilton’s Principle 


The trajectories between two points q = a and q = b in configuration 
space, starting and ending at times t, and ty, are given by the stationary 
paths of the functional 


/ Tenaga ea (4.78) 


The functional S is important in advanced dynamics (and in quantum me- 
chanics). Hamilton used it to draw important analogies between particle 
dynamics and optics, which were important in the development of quantum 
mechanics in the early part of the 20th century. Because S' is important, 
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it is called the action; it has the units of energy x time, that is, dimensions 
ML?T~!, which is the same as angular momentum. 


It follows from the theory described in Chapter 3 that the path between 
the end points is given by the solution of the associated n Euler-Lagrange 
equations, 


dt \ 0g 
These coupled, second-order equations are called Lagrange’s equations of 
motion, because Newton’s equations were first cast in this form by Lagrange; 
the full account of this theory was published in Mécanique analytique (1788). 
A short history of the development of these ideas is given at the end of this 
section. 


L 
d € )-x-9 Sa 2) =s, (212 4 


If L does not depend upon a particular generalised coordinate, q; say, the 
associated Lagrange’s equation can be integrated directly to give 


i = constant. (4.80) 
0" 

That is, for this coordinate Lagrange’s equation is a first-order equation 
and hence easier to solve. A generalised coordinate like q is named an 
ignorable coordinate. As an aside, we note that this is an illustration of 
Noether’s theorem (page 120) in operation; see Exercise 4.54. One aim in 
choosing generalised coordinates is to obtain as many ignorable coordinates 
as possible, because this helps to find solutions. An example is given in 
Section 4.5.1. 


Note that if the time interval |t, — t,| is sufficiently small it can be shown 
that the stationary path is an actual minimum of the action. The reason for 
this is that for these times the kinetic energy term dominates the integral 
and the motion is close to that of a free particle; see Exercise 4.49. 


Apart from the advantages discussed above, you will recall that part of 
the original motivation for developing Lagrange’s equations is the ease with 
which constraints are included. For instance, consider, the bead constrained 
on a rigid wire in the shape y = f(x): in Section 4.2.1 we dealt with the 
case f(x) = ax?/2. The potential energy is V = mgf (zx), and the speed is 


vad? tg? = 2" (1+ f'(z)*) since 9 = f'(a)s, (4.81) 
so, with x chosen to be the generalised coordinate, the Lagrangian is 

L = sma" (1+ f'(x)*) — mgf (z). (4.82) 
Lagrange’s equation of motion is then 

a ¢, ; 

< (4 (1+ f'(a)?)) - 7 F'@)F"(a) + of (a) = 0. (4.83) 


You may like to compare the efforts needed to derive this and equation (4.5) 
(page 147), where f(x) = ax?/2. 


4.4 Lagrange’s equations and Hamilton’s principle 


Exercise 4.2] 


Show that with f(a) = ax*/2, equation (4.83) expands to equation (4.5) (page 147). 


Exercise 4.22 


Two particles, of masses m; and mg, are joined by a light inextensible string passing 
over a smoothly turning pulley of radius R without slipping, as shown in Figure 4.13. 


(a) Assuming that the mass of the pulley is sufficiently small for its energy of 
rotation to be neglected, show that the Lagrangian for the motion of the system 
is 


Eis, 2) = 5 (my + mg) 27 + (my — me) gz. 
(b) Ifthe rotational kinetic energy of the pulley is T, = i M,R?6?, where @ is the 


angular velocity of the pulley and M, is a constant, show that the Lagrangian 
becomes 


L(z,2) = $Mz? +(mi—me) gz, where M =m, +m2+ M,. 
(c) Solve the equations of motion for the case described in part (b). 


Exercise 4.23 


(a) A particle of mass m moves under the influence of gravity along the smooth 
spiral wire defined, through the parameter @, by 


f=ncoso, wooded, «= ke, 


where a and k are positive constants and the z-axis is vertically upwards. 
Obtain the Lagrangian and the equations of motion, and solve these to find 


g(t). 


(b) If, instead of being fixed, the spiral rotates about the z-axis with angular 
velocity 2 = 6, the parametric equations of the curve become 


x=acos(@+O(t)), y=asin(d@+O(t)), z=k¢. 


Find the Lagrangian and the equations of motion in this case. 


Exercise 4.24 


In this exercise we use Hamilton’s principle to find an approximation to the solution 
of Lagrange’s equations, rather than to derive the equations of motion. 


Consider a particle of mass m moving along the z-axis in the potential V(x) = 
ma*x*/4, where a is a positive constant. It can be shown that all solutions of the 
equation of motion are periodic. 


(a) Show that the action for a T-periodic solution is 
i 
S\x| = m | dt (54° —4a°x*), x(0)=A, 2(T)=A, 
0 


for some A. 


(b) By approximating a periodic solution with angular frequency w and amplitude 
A with the function x = Asinwt, show that the action is approximately 
ammA? 
16w 
You will need the integrals 


27 27 
2n-— Ly 2n — 3)...3.1 
/ du cos u = | du sin?” u = 27 (Ges Dien 3) 8 
0 0 2n(2n — 2)...4.2 


S{(A) = (8u* — 3a” A?) . 
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(c) Hamilton’s principle requires that S|] is stationary: hence an approximation 
to the motion is obtained by choosing A so that S(A) is stationary. This will 
give an approximation to the amplitude of the motion with frequency w. 


By finding the solutions of S’(A) = 0, show that the amplitude and frequency 
are related approximately by 


Note that the exact relation between the amplitude and frequency is A = 
1.1803w/a, so the approximation has a relative error of about 2%: the exact 
solution is, of course, a more complicated function of t than the approximation 
used here. 


Conservation of energy 


If the Lagrangian does not depend explicitly upon the time t, then it may 
be shown using Noether’s theorem (page 120) that there is an integral of 
the motion, equivalent to the first integral derived in Exercise 2.6 (page 51) 
and used in Sections 2.5 and 2.7. This derivation is not an assessed part of 
the course, but the interested student should compare the functional (3.132) 
(page 120) and (4.73). We see that x and y corresponds to ¢ and q and also, 
in the equivalent of equation (3.131) ¢ = 1 and yy, =0,k =1,2,..., n. The 
first integral, equation (3.133), is 


ge -L=E, (4.84) 
k 


where F is a constant, which is normally the energy. This expression is 
usually referred to as the energy integral. 


Consider the simple Lagrangian defined in equation (4.77), for which gq; = 2, 
g2 = y and q3 = Z, in the case where the potential energy is independent of 
t. Then 


5 
So dea =m (4? + 9? + 2) =2T. (4.85) 
a. 


Hence & = 2T —-(T—V)=T+V so, in this case, E is the total energy. 


Exercise 4.25 
Show that the constant of motion associated with the Lagrangian (4.82) is 
E = Sma? (1+ f'(x)?) + mgf (a). 


Use this to find an expression for « in terms of z and LE. 


Addition of a total derivative 
If L and L are two Lagrangians related by 


La.4.t) = La.4.t) + “f(a,), (4.86) 


where f(q,t) is any differentiable funtion of q and t, but not the generalised 
velocities, then Lagrange’s equations of motion for L and L have the same 
solutions. 


4.4 Lagrange’s equations and Hamilton’s principle 


This result follows directly from the variational principle, because we have 


a ty = ty, df 
5= | w= f dt (i+) 
= S+ f(b, ty) — fla, ta). (4.87) 


The functionals S and S differ only by a constant, so a stationary path of 
S is also a stationary path of S, which proves the result. 


This result is often useful when the transformation between generalised and 
Cartesian coordinates involves the time: for example, in the case of a bead 
on a moving wire (page 148), we have y = f(x) + 7(t) and the speed is given 
by 

rae ty aH st (fi (c)t+7)’, (4.88) 
and the Lagrangian (4.82) becomes 


L = yma" (14+ f'(x)”) + méf! (x)4¥ — mgf (x) — mgy(t) + $m*. (4.89) 


Now observe that #f’(x) = df /dt and that f+ = d(f+) /dt — f(x)¥. Hence, 
the Lagrangian may be written in the form 


L = 5mé? (1+ f'(x)*) —m(g + Hf) 
+m] (f@)4) — alt) +47]. (4.90) 


The square brackets enclose three terms all of which are the derivatives of 
some function of the time so may be ignored. Hence the Lagrangian for this 
system is 


L = 5ma? (1+ f'(z)*) —m(g +4) f(z). (4.91) 


The only difference between this Lagrangian and that defined in equa- 
tion (4.82) is that g is replaced by g + ¥. 


Exercise 4.26 


If f(x) = ax?/2, show that the equation of motion given by the Lagrangian (4.91) 
is the same as in equation (4.8) (page 148). 


Exercise 4.27 


In this exercise we derive the Lagrangian for a particle of mass m moving on a 
rigid wire in the vertical plane, with shape y = f(x), where the y-axis is vertically 
upwards, as in the text, but using s, the distance along the wire, for the generalised 
coordinate. 


(a) If s is the distance along the curve from some fixed point, show that the 
kinetic and potential energies are T = m8?/2 and V = mgy(s), and hence that 
the equation of motion is § + gy’(s) = 0. 


(b) Consider the cycloid discussed in Section 2.7.1 and shown in Figure 4.14. 


This is a hard exercise. 
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Figure 4.14 Diagram of a bead sliding on a wire in the shape of a cycloid. 
The distance along the curve from O to P is denoted by s. 


The position of a particle at P, moving between the cusps at A and B, is 
defined uniquely by the distance, s, along the curve from O; alternatively, the 
coordinates of the point are defined by the parametric equations x = a(@ + 


sin), y = a(1 — cos ¢), || < 7. 


Show that s = 4asin(¢/2), where s is zero at the minimum, and y = s?/(8a) 
and hence that the equation of motion is 


d?s gq 
—+—s=0, |s/ < 2a. 
dt? is Aa is] s 
(c) Write down the general solution of this equation and show that the particle 
oscillates with a period that is independent of the amplitude of the motion. 


Find this period. 


Deduce that the time to reach the bottom of the cycloid, starting from rest at 
any point between O and B is the same. 


Exercise 4.28 


Consider the pulley system illustrated in Figure 4.9 (page 163): show that the 
Lagrangian for the particle of mass m is 


L = 3(M +4m)y? + $(2m — M)gy, 


where y is its distance below the ceiling. Use Lagrange’s equations of motion to 
obtain the solution 


2m—M 
= A+ Bip 
J ” 2 


where A and B are constants. 


gt’, 


A short history of Hamilton’s principle 


The development of the calculus of variations during the 18th century was 
complemented by contemporary work in physics, namely the Principle of 
least action. ‘This, in turn was motivated by the observation, by Euclid, 
that light rays reflected in a mirror travel the shortest path, as described 
in Section 1.5.5 (page 23), which led to Snell’s law and Fermat’s principle 
(published in letters dated 1657 and 1662). 


Maupertius when working on the theory of light proposed, in 1744, his 
Principle of least action which he developed from Fermat’s principle. The 
original formulation is rather vague, simply stating that the action is the 
product of the mass, speed and distance travelled and that this is a mini- 
mum along an actual path. Euler, also in 1744 after correspondence with 
Maupertius, formulated the same law a little more precisely by noting that 
the rate of change of the integral [ dt v* for a change in the path must be 
zero, though the precise meaning of this statement is not clear. 


Euclid, ca. 325-265 BC. 


Pierre-Louis Moreau de 
Maupertius, 1698-1759. 


4.5 Applications of Lagrange’s equations 


Lagrange clarified the situation by defining the action to be the integral of 
the kinetic energy. Thus for a single particle the action was defined to be 


t2 
W= [ aT, T=3m(s*+y° +2). (4.92) 
Lagrange assumed that energy was conserved, T+ V = E, so dealt only 
with conservative systems. Lagrange’s principle of least action states that 
W must be a minimum or a maximum along an actual path joining two 
given points in configuration space, at times t; and tg. With this princi- 
ple the admissible paths have the same energy as the actual path, so this 
variational principle is different from those dealt with in Chapter 2, because 
the admissible paths are constrained by an additional equation. Using this 
principle Lagrange was able to derive Newton’s equations of motion. 


Lagrange made the further step of introducing the generalised coordinates 
and showed that, for conservative systems with n degrees of freedom, New- 
ton’s equations of motion can be written in the form 


@ for of oY 
——— ee — ee —= — 2 re er ° bg 
: (5) @ a) eo. (4.93) 


This analysis was described in Mécanique analytique (1788), but first sug- 
gested in Miscellanea Taurinensia II (1760). 


The next major development was due to Poisson. In 1809 he defined the 
function L = T —V, that is, the Lagrangian function introduced in equa- 
tion (4.73) (page 174), and noted that if V depends only upon the gener- 
alised coordinates (that is, neither the time nor the generalised velocities) 
the equations of motion of Lagrange can be written in the form given in 
equation (4.74). He also introduced the generalised momenta, defined by 
pr = OL /Ogx, k = 1,2,...,n, which play a major role in later developments 
of the theory, but do not occur again in our story. 


Hamilton, in 1834, introduced the action integral defined in equation (4.78), 
but his theory was not limited to conservative systems. Hamilton’s principle 
reduces to the original variational principle of Lagrange when (a) the system 
is conservative, and (b) the kinetic energy, 7’, is homogeneous of degree 2 
in the generalised velocities. That is, for any constant \, T(Aq) = A°T(q), 
which is the case for many mechanical systems. Hamilton also demanded 
only that the action was stationary along the actual path. In his 1835 paper, 
Hamilton introduced the Hamiltonian function and set the scene for modern 
developments. 


4.5 Applications of Lagrange’s 
equations 


In the following three sections we show how Lagrange’s equations are derived 
for some typical systems with one or two degrees of freedom. By comparing 
this analysis with the equivalent analysis in Section 4.2, you will see how 
much simpler Lagrange’s method can be. In the last example, Section 4.5.4, 
we show how the theory is extended to describe the partial differential equa- 
tion for transverse vibrations of a taut string. 


As you read through these applications you should be able to discern a sim- 
ple pattern, easiest to describe for a particle moving in a Cartesian plane 


isl 


The kinetic energy was 
originally named, by 
Leibniz (1646-1716), the 
visa viva, or living force, 
and this name was in use 
until the last half of the 
19th century. 


Siméon-Denis Poisson, 
1781-1840. 
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Oxy, but constrained in some way, so there is only one generalised coordi- 
nate, q. In this case, the position of the particle is defined by two functions, 
x = f(q,t) and y = g(q,t), and everything follows from these functions. For 
instance, the potential energy will be some function of x and y, and hence g. 
The kinetic energy is proportional to the square of the speed, v? = «? + 9’, 
and since x = fgq+ ft and y = ggg +, the kinetic energy is given directly 
in terms of q, q and t. Hence, the Lagrangian is obtained directly from the 
functions f(q,t) and g(q,t). If there are more particles the method is exactly 
the same. As you read through the following sections, this pattern to the 
analysis should be noted. 


4.5.1 The simple pendulum 


We now return to a discussion of the simple pendulum, dealt with previously 
in Section 4.2.2, but first provide some general comments to explain why this 
is an important problem. 


Some general comments 


The simple pendulum is important for both historical and practical reasons 
and here we provide some background to explain why this is. First, it is 
worth noting that in 1889 a bibliography of pendulum papers was published 
by the Société Frangaise de Physique: Mémoires sur le Pendule (Gauthier- 
Villars) contains a list of papers on the pendulum, published between 1629 
and 1885 that runs to over 1300 items, many of which are concerned either 
with the construction of an accurate clock or with geophysical measure- 
ments. 


Equations of motion similar to that of the pendulum occur in a variety of 
important problems. One is when a periodic driving force shakes a me- 
chanical system at a frequency close to its frequency of natural vibrations 
and resonances occur. Another important, mathematically similar, system 
is found in the hindered rotations of certain types of molecules. An exam- 
ple is CH3CF’3, the molecular structure of which is shown schematically in 
Figure 4.15. 


F 
Figure 4.15 Diagram showing the structure of the CH3CF3 molecule. 


In this molecule the end groups, CF3 and CH3, can each rotate rigidly 
and independently about the common C-C axis. The torque between the 
two groups clearly depends periodically upon the relative angle between the 
groups. It transpires that this relative rotation is dynamically similar to 
that of the simple pendulum considered here. 


Historically the simple pendulum was of crucial importance in the develop- 
ment of Newtonian dynamics and it was studied and used extensively by 
Galileo and Newton, see Exercise 4.30. It is one of the simplest mechanical 


4.5 Applications of Lagrange’s equations 


systems amenable to experiment and accurate measurement, so it is perhaps 
surprising that it was not studied by the Greeks. Moreover, it appears that 
there was little quantitative knowledge of the pendulum before the time of 
Galileo and his contemporaries. 


It was Galileo, however, who discovered all the major properties of the 
pendulum. It is reported that in 1581, as a medical student in Pisa, his 
observations of a swinging lamp, in Pisa Cathedral, led him to believe in the 
isochrony of the pendulum. This was later confirmed experimentally and 
stated in a letter in 1602; here he noted that the period was independent of 
the mass, and also the veracity of the result found in Exercise 2.18 (page 66). 
Only much later, in 1638, did he show experimentally that the period was 
proportional to the square root of the length. This result was also stated 
by Mersenne in 1635 and the Italian physicist Baliani in 1638, although at 
this time the relevant length of the pendulum was not clearly defined. Of 
greater significance, however, Galileo used his observation of the pendulum 
to show that the, now unfamiliar, Aristotelian doctrine of ‘mutually exclusive 
motion’ was conceptually flawed and hence opened the way to the modern 
view of motion. Here is not the place to elaborate on this problem but a full 
account is given by Ariotti, which gives some idea of the intellectual effort 
required to overcome conventional wisdoms. 


After Galileo, the pendulum became a tool for regulating clocks and, through 
the simple formula for the period, T = 27,/1/g, an instrument for determin- 
ing the local value of g and hence properties of the Earth. The first major 
step in clock design was due to Huygens, as described in Section 2.7.1. From 
1656 onwards Huygens’ invention brought significant improvements in the 
accuracy of time keeping. The first pendulum clock was accurate to about 
30 seconds per day and the last, the Shortt pendulum clock (1921) was ac- 
curate to a one second per year; this was used in the Greenwich Observatory 
from 1921 until 1942, when it was replaced by a quartz crystal clock based 
at the Post Office research establishment in North London. The BBC ‘pips’, 
first started in 1924, were controlled by a pendulum clock until 1949. 


Derivation of the equation of motion 


We now return to our main pursuit, which is to derive, once more, the 
equation of motion for the pendulum. The idealised pendulum comprises a 
heavy particle, P, of mass m, firmly attached to a weightless, stiff, inexten- 
sible rod of length /. The other end of the rod is attached to a fixed point 
O, about which it can swing freely. Such a system is shown schematically 
in Figure 4.16. 


mg 


Figure 4.16 Diagram showing the idealised pendulum of length /, with smooth 
hinge at O and particle of mass m at P. The coordinate axes are shown with Oy 
vertically upwards. 


As before we assume that there are no constraints on the value of 6, the 
angle between OP and the downward vertical. 
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The derivation starts by finding expressions for the position vector and speed 
of the particle P in terms of the generalised coordinate 9. The Cartesian 
coordinates of the position are given in equation (4.9) and for convenience 
we repeat them here: 


x=Ilsin@d and y= -—lIcos@, (4.94) 
where we use the same coordinate system and notation shown in Figure 4.16. 


The potential energy is V = mgy, and in terms of the generalised coordinate 
this becomes 


V(@) = —mgl cos 0. (4.95) 
The velocity of the mass is (%, y), which, on differentiating equations (4.94), 
becomes 

¢=10cos@ and y=l6sin6, (4.96) 


so its speed is given by v? = #? + y? = 1262, and hence the kinetic energy is 
T = mv? = 4ml?6". (4.97) 
Thus the Lagrangian function is 
L£= i=). = imi?" + mgl cos 6. (4.98) 


Notice that all we need for the derivation of the Lagrangian is the position 
of the particle in terms of the generalised coordinate, equation (4.94), and 
its time derivative, equation (4.96). 


Since OL /00 = ml26 and OL /00 = —mglsin 6, the equation of motion is 
J 
l 


which is the same as equation (4.13). You may like to compare the efforts 
needed to derive this and equation (4.13) (page 150). 


6+ sind =0, (4.99) 


Exercise 4.29 


(a) Show that the first integral of the motion, equation (4.84), for the pendulum 
is just the total energy, 


Eel +V = tml? 6? — mgl cos 8@, 


and that the minimum value of the energy is E = —mgl, corresponding to the 
pendulum hanging downwards, 0 = 0, and stationary, 0 = 0. 


(b) By putting E = —mgl(1 — 2k?), k > 0, show that the energy equation becomes 


4 
@ = — (k? — sin?(6/2)). 
Deduce that if 0 <k <1, the value of @ is restricted to |6| < a < 7 where 
sin(a/2) = k. 


If the pendulum starts at 6 = —a with 6 = 0, show that the time, 7, to reach 
6 = a is given by 


ae a 1 
_ Jaf Ose" Foe sin 5). 
GJ—-a  4/k2 — sin?(6/2) 
Note that 7 is half the period. 


(c) Define a new variable ¢ by the equation sin(@/2) = ksind to show that the This is a slightly harder 
period, T’ = 27 can be written in terms of the integral calculation which you 
should do only if time is 


2 i 1 ilable. 
T =4,/- / dé —————,_ k = sin(a//2). ian 
g Jo 1 — k? sin? 


4.5 Applications of Lagrange’s equations 


Express the integrand as a power series in k using the binomial expansion 
and the integrals given in Exercise 4.24 (with appropriate adjustments to the 
limits), to obtain the approximation 


l 
T= ant (1+ gk? + Sk* + O(k*)). 


Exercise 4.30 


Use the energy equation derived in the previous exercise to show that if the pen- 
dulum mass is released from rest at Po, the speed of the mass at the bottom of the 
swing, # = 0, is proportional to the distance d = PPo, shown in Figure 4.17. 


The above result allows an indirect, accurate measurement of velocity, which was 
important in Newton’s time because other methods were not available. Newton 
knew of this result, which he stated to be “a proposition well known to Geome- 
ters”, and used it in his experiments involving the collision of two pendulum masses 
prior to formulating his laws of motion. 


A moving point of support 


In the second example considered in Section 4.2.2, when the pendulum hinge 
is attached to a freely moving particle of mass M, see Figure 4.3 (page 151), 
the Cartesian coordinates and velocity of P, with respect to the Oxy-axes, 
are 


r=(X+l/sin0,—lcos@) and r= (x + 10cos 6,10 sin 0) AOD 
so that the potential energy is V = —mglcos@ and the kinetic energy is 

T = 4MX? + bm (X? + 76? + 21X6 cos). (4.101) 
Hence the Lagrangian is 

L=3(M+ wi + 1ml?6? + ml0X cos @ + mgl cos 6, (4.102) 


which depends on two generalised coordinates, X and 9. However, it depends 
upon X and not X, which is therefore an ignorable coordinate, so the X- 
equation of motion is simply 


a . 
bind = 4.103 
7 ((M +m)X + ml@ cos 0) 0, ( ) 
or 
R= A-phese, «tus (4.104) 
— [ COS ? bt i, M+m’ i 


where A is a constant. On differentiating this with respect to t we obtain 
equation (4.20) (page 152). 


On dividing by ml, the equation for @ is 


< (16 + Xcos@) + Xdsind + gsind = 0 (4.105) 
which, on using the above expression for X , becomes 
(1 - LL COS” 0) 10 + 16? sin 0 cos @ + gsin@ = 0, (4.106) 


which is just equation (4.21). 


In this case the equations of motion simplify because X is ignorable. Us- 
ing the expression for X in the energy integral (see equation (4.84) and 


P 


Figure 4.17 
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Exercise 4.31) we obtain 
g m 


2 (1 — wcos” 8) 6° — = cos 6 = B, b= ne A 


where B is a constant. Since 0 < ps < 1, the coefficient of 6? is never zero. 


(4.107) 


Exercise 4.3 | 


Show that the energy integral, equation (4.84), for the above system can be written 
in the form | 
EE 


9 292 ‘a! 
5X + Syl 6 net oo Se are 


Using equation (4.104) for X show that this simplifies to equation (4.107). 


Exercise 4.32 
If the initial conditions at t = 0 are 0 = 0, 0 = 0, X = X =0, use equations 4.104 
and 4.107 to show that 
., 4g sin?(09/2) — sin?(0/2 
X(t) = pl(sin®y9 —sin@) and 67 = -- ee a 


If |09/ < 1, show that the approximate solution is 


A(t) = Aj coswt, X(t) = 2ylOpsin*(wt/2), where w= aa 


4.5.2 The double pendulum 


For the double pendulum, we saw in Section 4.2.3 that the two generalised 
coordinates can be taken to be the angles 6; and 62 defined in Figure 4.4 
(page 153). The derivation of the Lagrangian equations of motion for this 
system starts by deriving expressions for the Cartesian coordinates, (21, y1) 
and (x2, y2) for each particle, in terms of the two generalised coordinates. 
This analysis is given in Section 4.2.3, but for convenience we repeat the 
relevant equations here: 

Ly, = ly = 01, | y= ly COS Gy. (4.108) 

x2 =l,sinO; +losin@2, yo = —l, cos@; — le cos Oo. 
The total potential energy is the sum of the potential energy of each particle, 
V =migyi + mzgy2; in terms of the generalised coordinates, this is 


V (64, 05) = —mygl, cos 61 — mag (14 cos 6; + lp cos 2) : (4.109) 


The speed, v1, of P; is given by the formula uv? = #7 + 9? = 176?, where we 

have used equations (4.12) (page 150). The expression for the speed v2 of 

the second particle is more complicated, though derived in the same manner: 
2 


v2 = 43+ 9% 
; 2 ; 9 
— (11 cos 6; + 1262 cos 02) + (116 sin 6; + I> sin 0> 
== 126? “ 214150105 cos(6; = Q2) ah 1262, (4.110) 


where we have used equations (4.26) (page 154) for zg and wy. Thus the 
kinetic energy of the system is T = (m,v? + mgv3)/2, which becomes 


, = 5l¢(m4 sf mz2)0° + 4137m265 + Mo11190105 cos(6; — 65). (4.111) 
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This is a hard exercise. 


4.5 Applications of Lagrange’s equations 


Hence the Lagrangian function, L = T — V, is 
bs sz (m4 + m2)67 + mal1l261 6 cos(@; — 62) + 415m203 
+ (m1 + m2)gly cos 61 + mgglzcos62. (4.112) 
The equations of motion are derived from this using Lagrange’s equation of 


motion, equation (4.79). The simplest is that for 62, so we derive that first. 
Using the relations 


OL i 

b> = —meaglo sin Q> + mol,101 45 sin(6; _ 2), (4.113) 
2 

OL : ° A 

Abe a Magl5 Oo + mol, 1204 cos(6; — 02). (4.114) 
2 


Notice that the product mgl2 is a factor of both these equations and hence 
does not appear in the equation of motion. Thus Lagrange’s equation for 65 
is 


| oe 2 
dt (126 + 1,6; cos(@1 — 00) + gsin 42 — 116162 sin(#1 — 62) = 0. (4.115) 


After expanding the time derivative, this simplifies to 
116; cos(9, — 82) — 1,0? sin(@; — 02) + 1562 + gsin @ = 0, (4.116) 
which is the same as equation (4.30). 


The other equation of motion, for #;, is obtained in the same way, although 
it is more complicated. The basic partial derivatives are 


OL 


56, = —mal4 126102 sin(@1 = 02) aa (my + m2) gly sin 01, (4.117) 

1 

AL, 2; : 

—s (my + ma)126, + malyl262 cos(61 — 09), (4.118) 
1 


and we see that Lagrange’s equation for 6, is 


d 
ai (m1 + mg) 1701 + malyl262 cos(O1 — 00)) 


+— Mol1106165 sin(), an 02) i (my a m2) gly sin 0; = Q, (4.119) 


which, on expanding the first bracket and dividing by /1(m + m2), simplifies 
to 


ms @ cos(4, — 05) + 93 sin(01 a 02) a gsin 0; = 0, 
2 


ie 
191 Gace 


(4.120) 
which is the same as equation (4.40). 


The equations of motion, equations (4.116) and (4.120), are second-order 
nonlinear coupled differential equations which do not have any solutions that 
can be expressed as finite combinations of known functions. Indeed, under 
some circumstances the solutions display chaotic behaviour. The only limit 
in which simple solutions can be found is when both pendulums execute 
small oscillations: then the equations of motion can be approximated by 
coupled linear equations, which can be solved. ‘These equations are derived 
in the next exercise. 
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Exercise 4.33 


(a) Show that if |@;| and |@;|, i= 1, 2, are all small, so that second-order terms 
may be neglected, then the equations of motion (4.116) and (4.120) simplify 
to the pair of coupled linear equations 

m2 


1,6; i lo 05 > gA2 =e (, L601 5 pl20> + g6, =0, where a —— 
it, 


You will need the approximations sinxz ~ x and cosx ~ 1 for small |z]. 
(b) Substitute the functions 6; = A; coswt and 62 = A2 cos wt into the above equa- 


tions for the case 1; = l2 = 1, and show that two solutions are 


1 
o; =acosat, -G2= ras coswt, where w* = J 
m Se 


a 


4.5.3 A simple pendulum with moving support 


In this section we derive the equation of motion for a simple pendulum with 
a point of support that is oscillating vertically. This problem is interesting 
because it is one of the simplest examples showing the effects of vibration on 
a mechanical system. In addition, there are circumstances where the solu- 
tions behave counter-intuitively. Similar effects are seen in the interactions 
of powerful lasers and atoms. 


The system is shown schematically in Figure 4.18. 


Figure 4.18 Diagram of a simple pendulum with a vertically moving hinge. 


There is a particle P, of mass m, attached to the point A by a light in- 
extensible rod of length /. The frictionless hinge A is moving vertically, so 
its distance below a fixed point O is y(t), a known function of time. The 
Lagrangian of this system is, as normal, determined from the position coor- 
dinates of P with respect to a fixed reference frame, expressed in terms of 
the generalised coordinate 6. 


We use the axes Oxy, shown in the figure, with the y-axis vertically up- 
wards. With respect to the moving point A, the coordinates of P are 
(lsin 8, —l cos @), so in the fixed reference frame 


c=Ilsiné, y=-—y7t)—Icosé. (4.121) 
The potential energy is V = mgy. The square of the speed is 
9 ‘ 2 ; 2 949 ‘ 9 
y= (16 cos) i: (+- 16 sin 6) — 126? — 2146 sing ++. (4.122) 
Hence the Lagrangian is 


7 aml?" + mgl cos 6 — ml46 sin é + (my + mgy) (4.123) 
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4.5 Applications of Lagrange’s equations 


The first two terms of this Lagrangian are identical to those in the La- 
grangian (4.98), where the support is stationary. The third term is due to 
the motion of the support and may be simplified by noting that @sin@ = 
—d(cos 6) /dt, and hence that 


d 
= (Vcos 0) = ycos@ — 40 sin 6. (4.124) 


On substituting for 46 sin@ the Lagrangian becomes 


: Ao, 
L= smi? 6 +ml(g — 7) cos8+m sy + 9y+ = (ycos@)}. (4.125) 
Since the term in square brackets is the time derivative of a function of 6 
and t it may be ignored, to give the equivalent Lagrangian 


iz 5m?" + ml(g — 7) cos 6. (4.126) 


Thus the effect of moving the support vertically is to modify the value of g. 
For example, if the pendulum is in a lift uniformly accelerating upwards 
with acceleration a, we have 77 = —a and the effective value of g would be 
g +a, as might be expected. 


The effect is more interesting if y oscillates rapidly: for instance if y = 
asinQt, provided aQ > \/2gl (a result that can be proved using methods 
we cannot include in this course), the pendulum will oscillate stably in the 
upward vertical direction, that is, about 6 = 7, which is impossible for the 
normal pendulum. The equation of motion for this system is 


ae Q? sin Nt) sind = 0 4.127 
ae * ple sin 0t) siné = 0, (A427) 
and in Figure 4.19 is shown the numerical solution of this equation with 
units chosen to give g =1=1 and a=0.1, 2. = 40 and with the initial 
conditions @(0) = 3.0, 6(0) = 0. The value of 6(t) is shown for 0 < t < 2r, 
that is 40 oscillations of the support: we note that the motion comprises a 
long period oscillation, with period about 2.4, on which is superimposed the 
high frequency oscillations of the support. The dashed line is at 0 = 7, so 
we see that the pendulum is oscillating about the upward vertical. 


Figure 4.19 Graph of the solution @(t), showing stable oscillations about the 
upward vertical. In this example a = 0.1, 2 = 40, g =/ = 1 and the initial 
conditions are 6(0) = 3.0, 0(0) = 0. 


The general theory mentioned above shows that the low frequency oscilla- 
tions have the approximate frequency 


20? 
= 2 = ts (4.128) 


For the parameters used in Figure 4.19, this gives the period as 2.37. It can 
also be shown that some initial conditions lead to chaotic motion. 
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Exercise 4.34 


Consider the simple pendulum with its point of support moving horizontally, with 
known displacement y(t) from a fixed point. Using the notation defined in the 
previous text, show that the Lagrangian can be written in the form 


L= tml?" + mgl (cose — sin s) 


and that the equation of motion is 


d20 oe 

7) + - (sind + * cos8 a0). 
Note that this system will also oscillate about the upward vertical if y(t) is periodic, 
with sufficiently high frequency. 


Exercise 4.35 


Consider the simple pendulum with a fixed point of support, but with the length 
l(t) a known function of time. Show that the Lagrangian is 


i= iml(t)?6? + mgl(t) cos @ 
and that the equation of motion is 


d dé 


—{_ lsin 6d = 0. 
z (: a) +3 sin ) 


4.5.4 Transverse vibrations of a taut string 
(Optional) 


In this final, optional, section we complete the circle of this course by showing 
how variational principles can be used to derive the wave equation for the 
transverse vibrations of a taut string, that is, the equation derived in Block I, 
Chapter 1. The ideas presented here are important because they form the 
basis of the modern descriptions of electromagnetic and gravitational fields. 
However, the actual method we describe was first used by Johann Bernoulli 
in 1727 and repeated by Lagrange in 1759, but his emphasis was on finding 
solutions that satisfied the boundary and initial conditions. 


We start by approximating the string, initially lying along the z-axis, by 
a set of N + 2 identical particles a distance h apart, where (V + 1)h = L, 
L being the length of the string. Adjacent masses are connected by light, 
rigid rods. We assume that the ends of the string are fixed at x = 0 and L. 
In Figure 4.20 we show this approximation for N = 5, with the curved line 
depicting the disturbed string. 


we ==) L1 r2 X3 £4 Ls fear hb! = 


Figure 4.20 Diagram showing the discrete approximation to the string with seven 
particles, N = 5. 


Newtonian dynamics 


4.5 Applications of Lagrange’s equations 


The x-coordinate of the kth mass is x, = kh, with the first (k = 0) and 
last (k = N +1) particles stationary on the x-axis, so x, is constant. If the 
string has a uniform line-density p, we approximate it by locating a mass 
jt = ph at each of the moving points. 


In motion the y-coordinate of the kth particle is y, and it is assumed to 
move perpendicularly to the z-axis: it therefore has kinetic energy SUYe 
and the total kinetic energy of the system is 


N 
T=sho> @. (4.129) 


The potential energy, V is slightly more difficult to obtain. Consider the 
three particles centred at x,, shown in Figure 4.21. 


Lk—-1 Lk tit1 =< 
Figure 4.2! Diagram showing the forces on the particle at r;. 


The gradient of the kth element, between x,_; and zx, is tan6, and, by 
definition tan 6, = (Yr — yr-1)/h. We shall assume that |6;| < 1, for all k, 
so it follows that |y, — yr_1] < h. The distance between the particles at 
L,—1 and x, is, by Pythagoras’ theorem, 


h? + (yp — yea)? =k € - O(6%) ~ h, (4.130) 


since tan 0; = 6; + O(6?). Thus, to first order, the separation between adja- 
cent particles is constant; therefore the tensions in each segment are constant 
and the same, with value 7.. The force on the kth particle, in the y-direction, 


i, = ot = Tera — sO.) ee (4.131) 
T. fe 
ae PE AGpsa, Oe) rs (ead tues E (thy = tera) 5 (4.132) 


where V; is the potential of the kth particle, all others being held fixed. Now 
integrate with respect to y, (with y,_; and ypz4, fixed) to obtain 


+52 (ue — yea)’, (4.133) 
which is the potential producing the force Fy. Because V; depends upon 
Yk+1 we need to be careful not to include terms twice, hence the potential 
may be taken to be 

N 


fo D 
V= oh > ai ok. = oe (4.134) 


z3 
Ve = ah (Yai — YR) + 


As a check we can differentiate this with respect to y,, to obtain the above 
expression for Fy. 


Thus the Lagrangian, CL, is 


N 


N 

oo 

L=T-V=shp)> %- ah 2 | ean Sa (4.135) 
me k 


191 


192 Chapter 4 Newtonian dynamics 


The equation of motion for the kth particle is therefore 
: L, 
phyp, — 5, (Oe — 2yn + YR—-1) 4 oe ee © (4.136) 


If N is large, we assume that y;(t) is given in terms of a function (2, t), of 
x and t, by the relation 


O 0? 
Uy, = Hay; t), and thet go, = = sad. o5.= = (4.137) 
where the derivatives are evaluated at x = x;. Then we have 
— 2y~n + YR— 1 
Pest IRD = (lore + byt) — 2n(re.t) + mae — h.t)) 
h h2 
077 
= — 3 4+0(h 4.138 
<7 + O(h), (4.138) 
as may be seen by substituting the Taylor expansions 
O ome 
n(x +h,t) = n(x,t) + ha + 7 4 O(n). (4.139) 
Thus equation (4.136) can be written in the form 
On 2 On cs 
—— — =O = ,/—. 4.140 
aC ggt = Olhs e= [= (4.140) 


In the limit as h — 0 this gives the wave equation derived in Chapter 1 of 
Block I. 


Our goal, however, is not the equations of motion, but the variational prin- 
ciple from which these equations can be derived, because these more general 
concepts are more useful in other circumstances. In particular, it is eas- 
ier to derive the equations of motion in more complex conditions using a 
variational principle. 


For the case of N particles, the action is 


N 2 
ae 9 Yk+1 — Yk 
— ae rao eC 7 este <. STE) 9 . 
Sly] a dt »> i 7 ( ; | (4.141) 


with 
y(0)=yo and y(r)=y1, (4.142) 


where y(0) and y(7) are the initial and final shapes of the string. We now 
approximate this using the function n(x, t), introduced above, to obtain, 


© a || oe On 
on balls a 
=f] any |o (32) 7. (Bt 


where the kth term of the sum is evaluated at z,. But an integral can be 
defined as the limit of a sum, 


+ O(h*), (4.143) 


[ dz fiz) = = lim Som it Se =e, N+ Deo. (4.144) 
0 


Comparing this equation with the integrand in equation (4.143), we see that 
in the limit as h — 0, that is, as N — ov, the functional becomes the double 
integral 


2 5 [a fas E @) _T. (3) | (4.145) 


with the boundary conditions given by 


(0,1) = 9b, ff) =O for all ¢, (4.146) 


4.5 Applications of Lagrange’s equations 


and the initial and final conditions given respectively by 
n(z,0)=m(x), nv,7)=m(x) forO<a<L. (4.147) 


Notice that the first set of boundary conditions are the end conditions, 
corresponding to yo = yni1 = O for all t. This integral is the difference 
between the kinetic and potential energies of the string, 


a aa —i = 
T= 7 dx p (3) and V iT. | dx (3) (4.148) 


Equation (4.145) gives the action for a string, and the functional has be- 
come a double integral: it is not, therefore, clear how to obtain the wave 
equation using the fact that S is stationary. In fact, we use exactly the 
same technique. That is, we evaluate the action on a neighbouring accessi- 
ble path n(a,t) + €g(a,t): because this a possible path it fits the boundary 
conditions, so g(z,0) = g(z,7) = 0 and g(0,t) = g(L,t) =0. The Gateaux 
differential, equation (2.24) (page 48), is 


ee On dg _ p On dg 


We now proceed exactly as in Chapter 2, but a little bit more care is needed 
and it is convenient to treat each term separately. 


For the potential energy term we have 


T Z, On Og On L L On 
t — —— Al 
[eG a me) = fe ae I ac a ile 


where we have used integration by parts to rewrite the z-integral. But 
g(0,t) = g(L,t) = 0 for all t, so the first term on the right-hand side is zero 
and 


‘a L 
[a (| dix a) = - [af dv" g (4.151) 
0 0 Ox Ox 


For the kinetic energy integral we change the order of integration to obtain, 
by the same method, 


ZL T On Og L On (2 0? n 
aes ee ee Al 
/ axo( | dt ae) / deo(|5ra] [ dt 2 9 a) 52) 


and since g(x,0) = g(x,7T) = 0, this gives 


T L On Og 
4. 
[a(| dx p ae) = - fra fe dep Ny (4.153) 


Thus the Gateaux differential becomes 


2 2 
--[ at [ dx (> ae Ft | 9 (4.154) 


On a stationary path AS = 0 for all g(x,t) and using a version of the fun- 
damental lemma of the calculus of variations (not proven here), we find 
that 
An + 0?n 
POR © Ox? 
which is just the wave equation. Notice that in this derivation we did not 
make the assumption that the density was the same along the length of the 
wire, so this equation is valid if the density, p, depends upon 7. 


= 0, (4.155) 
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The general form of a functional depending on two independent variables x 
and t will be 


5 L 
Sig) = / at | dx F (x 4G, Gicay) , (4.156) Recall that 7,, = 0n/Ox 
0 0 and 7, = On/dt. 


where the integrand depends upon the unknown function, (x,t), and its 
first partial derivatives with respect to x and t. There will also be boundary 
conditions given by 


m0,t) = 9b.) =O tew-all ¢, (4.157) 
and initial and final conditions given respectively by 
nc, 0) = mle), (2,7 P Sai “tore 2 <8. (4.158) 


The required function(s) make this functional stationary and it can be shown 
that these satisfy the partial differential equation 


O (OF O OF OF 
a ee ee 4.159 
at (=) * Op (=) aed oe 


This is the Euler-Lagrange equation for the functional (4.156). 


Exercise 4.36 


Use equation (4.159) to derive (4.155) from (4.145). 


Exercise 4.37 


A string with linear density p(x) is embedded in a light, uniform rubber sheet such 
that in equilibrium it lies in a straight line, along the z-axis, with end points fixed 
at x = 0 and x = L, under tension T.. It is free to make small oscillations in the 
y-direction, perpendicular to the x-axis, and in the plane of the sheet. 


The rubber sheet acts on the element 6x of the string to exert a force of magnitude, 
dF = knox, where k is a constant, proportional to the displacement, 7, and towards 
the x-axis. Show that the action of the system is 


T L 2 2 
On On 
i dt d —} —T.{—)] —kr’ 
iL*), * (oe) (5) (a) ’). 
and find the equation of motion of the string. Recall that this system was also 
considered in Block I, Section 6.4.1. 


Exercise 4.38 


In previous derivations of the equation for transverse oscillations of a string it was 
assumed to be perfectly flexible, that is, no energy was needed to bend it. Here, 
we include this effect. 


An approximation to the bending energy is obtained by considering the force needed 
to bend an element of the string. Provided the curvature is small the potential 
energy due to bending an element of length dx can be shown to be 5 B(x)n2.,62, for 
some positive function B(x), which reduces to a constant, B, for a uniform string. 


In this case, show that the action for a finite string, fixed at x = 0 and x = L, is 


- L 
S= i} at | dx (p(x)n? — Ten; — Bre.) - 


Show that the Gateaux derivative of this functional for admissible functions satis- 
fying the conditions 

H0,t) = 9 1,4) =—0 forall £, 
and 


mMx,0)=n(x), (2,7) =m(e2) for O<2¢< LZ, 
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| T L T L 
AS|n, g| = | dt / dx | — pr + TeNae - Breve ates | dt eI 


Deduce that if the admissible functions satisfy the additional boundary conditions 
n,.(0,t) = a(t) and 7,,(L,t) = G(t), where a and @ are arbitrary functions of t, then 
the equation satisfied by 77 is 
O*n On 0? 
da Claes 
Ox Ot 


.—— 


= 0. 


Exercise 4.39 


Consider a string with linear density p(x), fixed at the points r = 0 and z= L, 
and with each point moving perpendicular to the x-axis, in the y-direction. A force 
F(x) per unit length acts in the y-direction and is independent of 77. 


(a) Show that the potential energy of the string is given by 


L an 2 L 
v=ar. | dx (52) -| Ge Fix) a. 
2) s Or ‘ ( )n 


(b) Write down the Lagrangian and the action functional for the static string. 
Use equation (4.159) to find the equation of motion and integrate this to find 
an expression for the displacement (a). Evaluate this expression in the case 
F(a) = constant. 


(c) Write down the Lagrangian and the action functional for the vibrating string 
and find the equation of motion. 


4.6 Further Exercises 


Exercise 4.40 


For each of the following systems, define suitable generalised coordinate(s) and state 
the number of its degrees of freedom. 


(a) A particle sliding on the interior surface of a hemispherical bowl. 


(b) A particle swinging in a vertical plane, at one end of a stiff, elastic rod, whose 
other end is fixed. 


(c) A particle swinging at one end of a stiff, elastic rod, whose other end is fixed. 


(d) A particle at one end of a light rigid rod, free to move in a vertical plane, whose 
other end slides freely on a given smooth, rigid curve. 


Exercise 4.41 
(a) Show directly from Lagrange’s equation of motion that the two Lagrangians 
H ¥ 2 - : 
hgad=;4@+o°, bhad=F (+), 
produce the same equations of motion. 


(b) Show that these Lagrangians differ by a total time derivative. 
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Exercise 4.42 


In this exercise we show that a potential function V(r) that depends on the two 
vectors rj and rg through the distance r = |r; — r2g| gives a force on each of the 
particles at r; and reg that satisfies Newton’s third law. 


(a) Using the definition r? = (x1 — x2)? + (y. — ye)? + (21 — 22)”, show that 
Or Or 
es we 5. = #4 ~ 2a, 
and that similar relations hold for (yz, z,), k = 1, 2. 
(b) Using this result and the chain rule, show that 
ON sce Oe ee 
Ox1 Ory Oy Oyo’ Oz Oz’ 


and hence that Fiz = —F21, where these forces are given by equations (4.63), 
that is Fi2 a —OV /Or, and Fo) = —OV/Or2. 


Exercise 4.43 


A set of coordinates, (u,v, @), defined in terms of the Cartesian coordinates (x, y, z) 
by the equations 


L=Uveso, y= wees 5 (u* —v’), 


are sometimes useful when there is a symmetry along the z-axis; these are known 
as parabolic coordinates. Show that 


ip? + ae 3? = (u? +0") (1° + 07) + u2v7¢?. 
Exercise 4.44 


Consider the motion of a single particle, P, in the Oxy-plane with respect to Carte- 
sian coordinates (x,y) and (u,v), where the Ouv-axes are rotating with respect to 
Oxy with constant angular speed 2, as shown in Figure 4.22. 


The position of a point P can be defined by the Cartesian coordinates (x,y) and 
(u,v), with obvious notation. The relation between these two coordinates is 


u=xcosQt+ysinOt, v= —axsin Qt + ycosNt. 
(a) Show that the inverse transformation is Figure 4.22 
x=ucosMt—vsinQt, y=usinONt+vcosNt. 


(b) If V is the speed of the particle in the Oxy coordinate system, so V? = 4? + y?, 
show that V2 = a? + 074+ 9? (u? + v7) — 20(tv — ud). 


(c) If (p,@) are the plane polar coordinates in Ouv, so u= pcos¢, v = psing, 
show that V? = p* + p2(@+ Q). 


Exercise 4.45 


Two blocks, each of mass M, are connected by an inextensible string of length 1. 
One block is placed on a smooth horizontal surface, and the other block hangs over 
the side, the string passing over a frictionless pulley, as shown in Figure 4.23. 


Determine the Lagrangian in the following two cases: 


(a) the mass of the string is negligible; 


(b) the string has uniform mass m. 
Figure 4.23 
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Exercise 4.46 


A particle of mass m slides on a smooth straight wire which is pivoted about a point 
O and rotates in a prescribed manner, in a vertical plane, with variable angular 
speed so that at time ¢ the angle between the wire and the horizontal is a(t). 


Initially the wire is horizontal, so a(0) = 0, and the particle is at rest, a distance 
A from O. 


Show that the Lagrangian of the system is 
Lb = 5m (q° + gq’ a) — mgqsina, 
where gq is the distance of the particle from the pivot. 


In the case where a(t) = Gt, show that the general solution of the equation of 
motion is 


q(t) = Acosh Gt + Bsinh Gt + FP sin (tt. 


Exercise 4.47 


A rigid wire in the shape of a parabola moves horizontally so that the coordinates, 
(x,y), of a point on it are related by the equation 


y = 5a(x— Y(t), 


where the y-axis is vertically upward and ¥ is a known function of time. Using the 
generalised coordinate gq = x — 7(t) show that the Lagrangian for a particle of mass 
m sliding smoothly on this wire is 


mgr (1 + a°q”) _ itmagq? — mq’. 


Exercise 4.48 


Find the Lagrangian and the equation of motion for a particle of mass m slid- 
ing smoothly on the parabolic wire y = a(t)x*/2, where a(t) is a positive, known 
function of the time and the y-axis is vertically upwards. 


Exercise 4.49 


Show that for a free particle the stationary path of the action gives a minimum of 
the action. 


4.7 Harder Exercises 


Exercise 4.50 


A governor is a device for controlling variations in the speed of rotation of an axle. 
An elementary mechanical governor consists of a mass m2 moving on a vertical axle 
AB, and two masses m, attached to it and to a fixed point O on the axle by rods 
of length a, as shown in the diagram. As the whole system rotates about AB with 
constant angular velocity 22, the angle 6 changes; see Figure 4.24. 


Show that the Lagrangian of the system is 
L = a?6? (my + 2mz sin” 0) aaah ax 8+ 2ag(m, + m2) cos 8, 
and that the energy is 


E = a’6? (my + 2mpo sin? 0) = ma? ein* p= 2ag(m 1 + m2) cos 6. 


Figure 4.24 
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Exercise 4.5 


A particle of mass m is attached to a vertical cylindrical post of radius a by a long 
horizontal string, which can wrap itself round the post. If R > a is the length of 
the unwrapped part of the string, show that the Lagrangian is 


ian = a 


Also show that the first integral of the motion is mR?2R? = 2a2E, where E is a 
constant. 


Exercise 4.52 


A particle is constrained to slide under gravity on a smooth wire in the shape of 
a vertical circle with radius R. The wire rotates about its vertical diameter with 
constant angular velocity 2. Show that the Lagrangian is 


Le imR? 0° + mM? R? sin? 6+ mgR cos 8, 


where @ is the angular displacement of the particle from the downward vertical, 
with the centre of the circle as origin. 


Exercise 4.53 


Kepler’s third law 

In this exercise we derive general properties of solutions when the potential energy 
is homogeneous of degree A in q, that is V(kq) = k*V(q); for instance the potential 
V = q?/2 is homogeneous of degree 2. This exercise is interesting because, when 
extended to three dimensions, the same idea can be used to derive Kepler’s third 
law without solving the equations of motion. 


Consider the Lagrangian 


L(q,4) = 3mq@ —V(q), where V(kq) = k*V(q). 


(a) Show that the arbitrary Lagrangians L(q,q,t) and aL(q,q,t), a being a non- 
zero constant, produce the same equations of motion. 


(b) Introduce the scaled variables Q and T by the relations Q = aq and T = Ut, 
a and b being constants, and show that if b = a!~/?, the expression for the 
Lagrangian in terms of Q and Q’ = dQ/dT is 


L(Q,Q') = a (3mQ’? —V(Q)). 


(c) Deduce that if q = f(t) is a solution of the equations of motion for L, then 
¢=ajy (a/ 2~1t) is also asolution. Show, in particular, that for periodic motion 
under such a potential, the period varies as A!~*/?, where A is the amplitude. 


(d) Deduce that in the case of a linear oscillator, where V(q) « q’, the period is 
independent of the amplitude. 


(e) For the one-dimensional Coulomb potential, V(q) «x q~', q > 0, deduce that 
the period varies as A®/?, which is just Kepler’s third law; see page 156. 


Exercise 4.54 


A Lagrangian L(qi, 41,42) is independent of gz, so gz is an ignorable coordinate. 
Use the appropriate form of Noether’s theorem (given on page 120) to show that 
OL /Oq2 = constant. 


Solutions to Exercises in Chapter 4 


Solutions to Exercises in Chapter 4 


Solution 4. | 


Since m, g and a are all positive, EF > 0. Differentiation of E with respect to t gives 


dE 
= MLL (1 os aa) = Sma? (2e° 22) + Mgaxrx 
= ee E (1 + ce) + one" + aga == {j, 
the last equality following from the equation of motion (4.5). Rearranging equa- 
tion (4.6) gives 


dx 2E — mgax? 
2 ae 2 

ma" (1+ a*x*) =2E —mga — = +,/ ———___.. 
( ) aS as m (1 + a*xz?) 


Solution 4.2 


The expression for FE is derived in the solution of Exercise 4.1. Substituting equa- 
tion (4.8) into this gives 
dE 
dt 
Note that «(t) = 0 is the only solution of the differential equation for which E = 0. 


2 a 
= “marry = 5 nays ‘Cai 


Solution 4.3 


If 4 = 0, equations (4.21) and (4.20) reduce to 0+ (g/l) sin@ = 0 and X = 0, re- 
spectively. The first of these is just equation (4.13). The second gives X = a + /[3t, 
where a and @ are constants, so the particle A moves with uniform speed along the 
rail and is unaffected by the swinging pendulum below it, as would be expected for 
a heavy particle. Also, the pendulum motion is unaffected by the uniform motion of 
the support, A: this is an example of Galilean invariance, discussed in Section 4.3. 


Solution 4.4 


Differentiation with respect to t gives 


= =(M+ m)X X + ml?66 + ml (X cos 0 + 6X cosé — 67.X sind + gé sin 8) 
Substituting for (MM +m)X using equation (4.20) reduces this to 

dE i? oe 

oe mle (10 + X cos @ + gsin 8) 


and using equation (4.19) we see that EF = 0, so E is a constant. 


Solution 4.5 


(a) Figure 4.25 shows the two vectors r; and ro, the vector R_ of the centre of mass 
at C, all relative to an origin O. By definition, (m1 + m2)R =m 1r1 + Mero. 
Also, addition of the vectors gives rj + rig —re = 0, r3 + rT; -R =O and 
R — 2 —rg = 0. Thus r; = R—r; and rz = R — ro, and using the centre of 
mass equation to eliminate R gives 


(my + m2)P1 = MT, + tTHers — (my + mM2)¥4 = M2(L2 ae ri), 
Or 


SF pg: 
mi + M2 


Similarly, 


(my + M2)¥2 = Mm 1r1 + taro = (my + M2)re = M4 (ry = ro), 


a ss 
mi 
ri 
O 
Figure 4.25 
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(b) 


(c) 
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or 


My 
[3 ee Fro, 
tly + We 


In the case where m, = mg and mz = mp, we have 


1 149.6 x 10° 
8 ss en WES Tit aoe eae to ppl 
my +m 328901 : 


3.289 x 10 
In the case where m, = mg and mz = mj, we have 


me 1 = 778.3 x 10° ¥ 
ee = TC a, 
mi+m: 1048.4’ Pil = 704g x 103 “lo note 


Solution 4.6 


The angle from any fixed point on the circle. One degree of freedom. 
The perpendicular distance from the plane. One degree of freedom. 


Two coordinates are needed to specify the positions on a two-dimensional sur- 
face. Suitable coordinates for a sphere are the latitude and longitude. Two 
degrees of freedom. 


Since the axis of rotation is known, only one further coordinate is needed to 
specify completely the orientation of the body. This may be taken as the angle 
of rotation about the axis. One degree of freedom. 


The motion of the circle is given, so only one variable is needed to fix the 
position of the particle on the circle. The same variable as in part (a) may be 
used. One degree of freedom. 


If the circle is free to rotate, its motion is affected by that of the particle 
moving on it, so two variables are needed. One of these may be the same as in 
part (a) and the other may be taken to be the angle of rotation of the circle. 
Two degrees of freedom. 


For any body moving in three-dimensional space, whether it is a particle or an 
extended solid body, three coordinates are needed to specify its position. For 
a solid body one would normally take these to be the Cartesian coordinates of 
the centre of mass, with respect to some given, fixed coordinate system. For 
a rigid body a further three variables are needed because it can rotate; these 
can be defined in a variety of ways, for example by using two to defined an 
instantaneous axis of rotation and the third to specify the angle of rotation 
about this axis. Thus the system has six degrees of freedom, or three if the 
centre of mass is fixed. 


Much depends upon how we approximate the system. If we regard the planet 
as a particle, then three coordinates are required to fix its position, which 
could be the three spherical polar coordinates with the sun at the origin; this 
system has three degrees of freedom. If the planet is regarded as a rigid sphere 
rotating about a given axis one extra degree of freedom, the angle of rotation, 
is required. If the structure of the planet is to be taken into account then its 
orientation must be specified as well as its position in space; this requires a 
further three coordinates, as in part (g), giving six degrees of freedom. 


Solution 4.7 


(a) 
(b) 
(c) 


The length of the rope is 2Y + y+ constant, hence the result. 
There is one degree of freedom. 


There are two reasons why the tension along the rope does not change. First, 
the rope is light, so any change in tension in a vertical section would result 
in an infinite acceleration. Second, the pulleys are smooth and light so the 
tension on either side must be the same, otherwise the angular acceleration 
would be infinite. 


Solutions to Exercises in Chapter 4 201 


Solution 4.8 
(a) In this example ¢=u+0 and y=wu-—1J, hence 2U=2%+ 4, 20 = 4-4. 
(b) Squaring and adding these expressions gives 

P+y =(u+0) +(u—d) =2(w? +0"). 
(c) Differentiation gives ¢ = u+0t+v and y =u — vt — v. Squaring and adding 

these expressions gives 
ety? =? + 0747 +? + Qudt + Quot + Que 
+ U + 0°? +0? — Qubt + Quit — Qvu 
= 2 (ua? + ot?) + 4vdt + 207. 


Solution 4.9 

Differentiation gives x = wv + uv and y = ut — vd, so that squaring and adding 
gives the quoted result. 

Solution 4.10 


Since 6s* = 6x” + dy? + higher-order terms, on dividing by dt and taking the limit 


as ot tends to zero, we see that v? = «2 + 9? = 8?. 


The kinetic energy can be expresed in a variety of ways. If y = f(x), soy = f’(x)z, 
we have 


T = 5mv? = 5m (a? +: 9°) = ma? (14 f'(x)”) = dmo?. 


Solution 4.1 | 
The kinetic energy is T = m (7? aa 726?) /2, where we have used the result derived 


in the text, equation (4.54). But since r = a6, 7 = a, so we may eliminate either 
? or r to give, respectively, 


2 
4. 6 . fig 2 42 2 
T = 5mr (+5) or T = 5ma‘d" (14+ 6). 


Solution 4.12 
The total kinetic energy is T = MY?/2 + my?/2. But 2Y + y = constant, so 2Y = 
—y and T = My?/8 + my? /2, which gives the required result. 


Solution 4.13 


(a) Partial differentiation of r? with respect to the independent variables x, y and 
Zz gives 


(b) The chain rule gives OV/Ox = OV /Or x Or/Ox, with similar relations for dif- 
ferentiation with respect to y and z. Hence 
a 22 ieee 
Or \ Ox’ Oy’ Oz) Or 
= Ve. 
Since F = —OV/Or, the result follows. 


= ws 


es 4 
' 2 = rT 


Solution 4.14 
The force is F = —kx = —OV/Or, hence 0V/Ox = kx and V = ka? /2. 
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Solution 4.15 
If z < R we may expand in powers of z/R by writing the potential energy as 


GMm ay 
rae Ga 

GMm z 2 
=F Rl 
West LO cal 
~ ia pie Re)” 


The first term is a constant and can be ignored so, to the lowest order, V = mgz 
where g = GM/R?. Using the given values g = 979.7cms~?. 


The actual value on the surface of the Earth varies with latitude because the Earth 
is not perfectly spherical. At the equator g = 983.221 and at the poles g = 978.049. 


Solution 4.16 


(a) The variation of the acceleration due to gravity, g(h), with the height, h, 
above the mean sea level is derived in the previous exercise, and is g(h) = 
GM/(R+h)?. A first-order Taylor expansion about h = 0 (that is, sea level) 
gives g(h) = g(0) + hg’(0); that is, if we denote g(0) by g, 

2GM Ag_ gih)-g 2h 


h)—g=-h See 
g(h) — 9 fp ee ; R 


Putting R = 6.378 x 10° m and h = 800m gives Ag = —2.5 X 10~“g. 


The rate of change of T’ with g can be written in the form T’(g) = —T/(2g) 
and to first order T(g + Ag) — T(g) = AgT’(g), which gives 


There are 24 x 60 x 60 = 86 400 seconds in a day, so the relative accuracy of the 
clock is AT'/T = 1.2 x 10~°. But the relative change in T due to the change in 
height is, from the above formula, AT/T = h/R ~ 1.2 x 10~* so this change 
can be measured by the clock in less than a day. 


Solution 4.17 
(a) The kinetic energy is T = mz*/2, and since E = T + V the result follows. 


If the initial speed is v, at z = 0, the energy is 
9 GMm _ 1, .2  GMm 
R+z’ 


since F/ is a constant. 


(b) As z— oo and z — 0, the right-hand side tends to zero to give the escape 


or i = 
Co T= R : 


(c) Using the data given for the Earth, the Moon and Jupiter we obtain the fol- 
lowing. 


speed, 


2 GMm 
-— — a 


Jupiter 
59.37 kms—! 
2451 cms~? 


9.37 kms" 
162.4cms~? 


11.18kms! 
979.6cms~? 


Solution 4.18 
Since T = mv?/2 and mv = F, we have T = mvi = mv-v =v-F, since 


d d d 
ve=v-v giving to = ey) Se Ee 


dt dt 
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Solution 4.19 


In this example we have 
OL ee. - 3S 
= “a 
and 
ab __ ab_ wv ab__av 
Ox Ox’ Oy Oy’ Oz Oz 
Hence Lagrange’s equation for the x-coordinate is 
d (OL OL dct OV 
a (5) ~ ae =" ap + oe = 
which reduces to the quoted equation: since —OV/Oz is the force in the x-direction, 


this is just Newton’s equation of motion for this coordinate. The results for the y- 
and z-directions follow similarly. 


Solution 4.20 


(a) First note that 
OV OVOr OV OV Or 
ae ee ay ae 
and since Or/Ox = x/r and Or/Oy = y/r (see Exercise 4.13(a)), Lagrange’s 


equations of motion in Cartesian coordinates are 


di). ee ee ae al 
ap ey ee) = and =O. 


(b) Since 4? + y? = 7? + r26? (see equation (4.54)), the Lagrangian becomes 
L=sm ie ~ 6?) —V(r). 

This is independent of 0, so this equation of motion becomes 
d (OL d 4 A 
— (| =— (mr?) =0, hence 6=—s, 
dt (F) dt Ga —— mr 
where A is a constant. The equation for r is 

d (OL OL ; 

OT ae ee ee. Wa) 

dt (z) Cem ae 
Using the equation for e gives the quoted result. 


Solution 4.21 
Since f(x) = ax? /2, f’(x) = az and f(x) =a, equation (4.83) becomes 
= (a(1+a*x")) — a’ aa” + agz = 0, 


which expands to #(1+a?x?) + 2a?xi? — a?xi? + agx = 0 and is the same as 
equation (4.5). 
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Solution 4.22 


(a) 


If the length of the string between the particle of mass m; and the pulley axis 
is z, and the total length of the string is /, then the length of the string between 
the particle of mass m2 and the pulley is 1—aR-—z. Thus the potential 
energies of the two particles are Vj = —mj gz, and V2 = —mgg (l—7R — z). 
Notice that the signs are negative because distance is measured downwards. 
The total potential energy is 


V(z) =V4+ V2 = —(m, — m2) gz — mogl + TMagR, 
and the last two terms may be ignored because they are constants. 


When the mass of the pulley is neglected, the kinetic energy is just the sum 
of the kinetic energy of the particles T = m1 27/2 + m22?7/2 = (m, + m2) 27/2, 
so the Lagrangian is 
| 5 (m4 + M2) a = (my ie M2) gz. 

If the mass of the pulley wheel cannot be neglected then its rotation will 
add a component to the kinetic energy, but not to the potential energy. 
The extra kinetic energy is given in the question in terms of 0; since the 
rope does not slip we have the relation R@ = z, so the new kinetic energy is 
T = (m,+m2+M,) 27/2 and the Lagrangian is 


L= 5 (my + m+ Mz) 27 +(m, — M2) gz. 
With M =m, + m2+ M, we have 0L/0z = (m1 — m2)g and OL/0z = M3, 
so Lagrange’s equation of motion is 

Mz= (m4 a m2)g, 
with general solution 
1m, — m2 
= 20 O)t + —————— 
z= 2(0) + 2(0)t+ 5 
As would be expected, if m , > m2 the particle with mass mz moves up, and 
if m, < mg it moves down. Also, the heavier the pulley wheel the less the 
acceleration. 


gt? 


Solution 4.23 


(a) 


In this example the motion is along a line in three dimensions so the speed v 
is obtained from v? = 4? + y? + 42 = (k? +a”) ¢?. The kinetic energy is thus 
T = m(k? + a?) ¢?/2. The potential energy is V(¢) = mgz = mgké, giving 
the Lagrangian 


L=T—V=im(k? +a’) ¢? —mgk¢, 


which is similar to the Lagrangian of a particle falling in a gravitional field. 

Lagrange’s equations of motion are (k? - a?) o+g9k=0. Direct integration 
. ; gkt? 

of this equation gives o(t) = 6(0 0)t — —.—_—~ 

quation gives 4(t) = $(0) + 4(0)t— 575m 

particle slides down the spiral with ever increasing speed. 


, which means that the 


When the spiral is rotating the coordinates of the particle are 
x=acos(¢+O@(t)), y=asin(¢+O(t)), z=kd. 
For a rotating wire the potential energy is unaffected by the rotation, but the 


; .\ 2 , 
speed is now given by v? = a? (4 + 6) + k*¢? giving the Lagrangian 
L=5m (a? + k*) o + 20°60) — mgké, 


where we have, as usual, ignored the term m6? /2 because it is a known function 
of time. 
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This Lagrangian can be cast in a more convenient form by removing the de- 
pendence on ¢, using a trick discussed later, on page 178. Observe that 


ee ae 4 
0 = —(0) — $0 
36 = — (46) — 06, 
so the Lagrangian becomes 
L= 5m (a° - k:*) 6? —m (gk = 06) @t+ ma? < (66) 


On page 178 we show that the last term, being a derivative with respect to 
time, may be ignored to give the simpler, equivalent Lagrangian, 


sm (a° +k?) ¢? —m (gk + a8) Q. 


The equations of motion are obtained using the derivatives of the first La- 


erangian, 
OL | OL 
a? +k?)d+ma?20 and — =-—mgk, 
ab ad Oo : 
which give the Lagrangian equations of motion 
m (a? +k?) 6 +ma*d + mgk =0 and pe eae (07d + gk). 


Solution 4.24 


(a) For a T-periodic solution z(t + T) = x(t), for all t. Ifz(0) = A, then x(T) = A 
and the action is 


= [a dt (4ma* — ‘ma*x*) , eI) 22 a) ae 
(b) If e = Asinwt, w = 27/T, the functional becomes 
27 /w 27 /w 
f= smA?w / dt cos? wt — ;mA*a . / dt sin* wt 
0 0 


2a Ata 2 21 
= SmwA? / dé cos? 6 — - = / dé sin* 6 
0 W 0 


ze = (8w* A? — 3a” A*) 


(c) The functional must be stationary, so S’(A) = 0, that is, 16w*.A — 12a?.A® = 0. 
So 
2 WwW 
=a or A= ae 
Solution 4.25 


The constant is 


B=#2!-L= ma? (1+ f’(x)?) — gma? (14+ f'(x)*?) + mgf (2), 


which reduces to the given expression. Rearranging this gives 


dz\" _2E — 2mgf(z) 
m(G) ar 


Solution 4.26 


The equation of motion is 
(1+ f'(a)’) +#°f'@)f"(a) + G+) F@) =0 
and if f = ax?/2 this becomes # (1+ (azx)?) + a?x%? + (g + ¥) ax = 0, as required. 
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Solution 4.27 


(a) It is shown in Exercise 4.10 that v? = 87, thus T = mv?/2 = mé?/2. The 
potential energy is V = mgy(s), so the Lagrangian is L = més?/2 — mgy(s) 
and the equation of motion is ms + mgy’(s) = 0, hence § + gy’(s) = 0. 


(b) On the cycloid 
_ wi dx \* dy 
tor Pao) + (35 
p 
~ a | do \/(1+ cos ¢)? + sin? ¢ 
0 
p 
= a | dd \/2(1 + cos ¢). 
0 


But cos ¢ = 2cos?(#/2) — 1, hence 


p 
= 2a | dd cos(@/2) = 4asin(¢/2). 
0 


Using the related trigonometric identity, cos¢ = 1 — 2sin?(¢/2), we obtain 
y = 2asin*(¢/2) = s?/(8a). Hence, from part (a), the equation of motion is as 
given. 


(c) The general solution of the equation of motion is 


8(0) |. | 9 
t) = s(0) coswt + ~sinuwt, w=,/—. 
s( ) s( ) cos Sl , din 


Thus s(t) = s(t +7), T = 27/w, for all valid initial conditions. That is, the 
period of the motion is independent of the initial conditions: no matter where 
the particle starts, it reaches the bottom in the same time. 


Solution 4.28 


The kinetic energy is found in Exercise 4.12 (page 169). The potential energy is 
V =—-(MY + my)g and, since 2Y + y is constant, the potential energy becomes, 
apart from a constant, V = (M — 2m)yg/2, giving the quoted Lagrangian. 


The equation of motion is (M+ 4m)y = 2(2m — M)q; integrating twice gives the 
general solution quoted. 


Solution 4.29 


(a) Using the expressions given for T and V we obtain the quoted expression, 


ft . ; 
if = ma —~L=ml’?* — (3mi?6? + mgl cos 0) ee +ml?? — mgl cos @. 


The minimum of F is at 6 = 0 and the maximum of mgl cos @, that is 0 = 0; 
thus £ > —mgl. 


(b) Using the half-angle formula, the energy integral becomes 
—mgl(1 — 2k") = iml?6? —mgl [1 - 2 sin? (0/2)| , 
which can be rearranged to give 62 = 4g (k? — sin*(@/2)) /l. Since 0 is real, we 
need | sin(@/2)| < k; if0 < k < 1, this means that |@| < a < 7 where sin(a/2) = 
C 


Since @ increases from —a to a, 8 > 0, and hence 


a. a ae” 
ha / k? — sin*(6/2), 


and integration gives 


/ dé poe a = fer. 
a  4/k2 — sin? (0/2) 
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(c) First note that the values of the contributions from the intervals (—a,0) and 
(0, a) are identical, so 


1 


a 1 pL pres ge 
T=2,/— | dé —————. = 2, | - do 
al \/k2 — sin? (0/2) al “ k2 — sin? (0/2) 


But 
1 d 
ee =kcos¢, so ew = we eee 
2 de dp 1 — k? sin? 
hence 


ae ia 1 
T=4 - | ds a a Baas 2): 
g Jo /1 —k? sin? ¢ 


Now use the binomial expansion (1 — z)~1/? = 14 z/2+3z?/8 + O(z?) to write 
the expression in the form 


Oe aie ak Te oe ’ 
T =4,/- dp {1+ — sin* ¢+ —sin* $+ O(k”) 
JG 2 re) 


and use the integrals, given in Exercise 4.24, but with modified integration 
limits, 
a/2 T n/2 37 
dé sin? d = -, / dé sin 6 = —, 
[ aesimto=F, [ dosinto= 2 
to give the quoted result. 


Note that for small k the relative difference between this expression and the 
small amplitude limit, k = 0, is 1% when k?/4 = 1/100, that is k = 2/10 or 
a ~ 4/10, which is a swing of about 23°. 


Solution 4.30 

At Po we have (6,0) = (09,0) and at P, (6,6) = (0,61), so the energy equation gives 
—mgl cos 99 = iml?67 —magl. 

Thus the speed at P, v = 16;, is given by 5u* = gl(1—cos9). But cosu = 1— 

2sin*(u/2) and elementary trigonometry gives d/(2l) = sin(@o/2), so 


$v? = 2gl sin* (9/2) x 2S = 


Solution 4.31 
The total energy is the sum of the kinetic and potential energies, 
E=3(M+ ve kee = iml?6? + ml@X cos 6 — mgl cos 6. 


This result can also be obtained from equation (4.84). Dividing by M +m, putting 
E=C(M +m) and using the relation X = A — l@cos@ gives 


: 2 : : ; 
C= 5 (A — pl cos 0) “ $l? 0° + pl cos é (4 — pl@ cos 0) — pgl cos 6 


= 4A? + Syl?6? — 117176? cos” 6 — pgl cos, 

which gives 

20 — A? | 
Quiz 


16? (1 — Lcos? 0) — = cos = 
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Solution 4.32 
The equation (4.104) for X can be written as 


X=A-wpl “(sin $) which integrates to X = C+ At-—plsin@. 


The initial conditions give A = 0 and C = plsin @, hence X = pl(sin @ — sin 6). 


Using the initial conditions in equation (4.107) gives B = —(g/l)cos@9, and on 
using the identity cos u = 1 — 2sin*(u/2) this becomes 


62 — 4g sin* (80/2) — sin? (6/2) 


l 1 — pcos? 6 


If |99| < 1, then using the approximations sin u ~ u this becomes 62 + w26? = Ww" 62 
where w = ,/g/(I(1 — y)). The solution of this equation is that quoted. For X(t) 


we have, for small 6, 
X(t) = pl(09 — 8) = plAo(1 — coswt), 
which gives the quoted result after using the identity cos ¢ = 1 — 2sin?(¢/2). 


Solution 4.33 


(a) If both angles are small, on using the approximations given, we see that equa- 
tion (4.120) becomes 1,6) + ul202 + g@, = 0 and that equation (4.116) becomes 
1,0, + [262 + g@2 = 0, with p as quoted. 


(b) Using the substitution given, these equations become 
Ay (wl; af g) — UwAglow” = 0, Aylyw* - Gap = g) Ao =a 
These equations, for A; and Ag, have non-trivial solutions only if 
iw? oh=¢ 
2 2 =, 
Wl —g = plow 

that is, if 

w*lyle(1 — p) — w* (ly + le)g +g? = 0. 


Putting /; = lz =1 we see that the solutions of this quadratic for w are 


eee ee 
Ll l—-yp 


g 1 


> fe ie 


If w is taken to be either of these values, then 


, thet is,. oe 


= oe ee 


ye g Aj g Aj 
A A = 
: plow? Ll ( 


Hence, putting A; = a we have 


1 
0; =acosat, t= reo coswt where w* = ss ’ 
il as 


Ji 
Solution 4.34 


The coordinates of the pendulum mass with respect to the point of support are 
(Jsin@,—Icos@), so with respect to a fixed point the coordinates are (Isin@ + 
y(t), —lcos@). The kinetic energy is 
2 2 
a # sm (idcosd +4) = (16 sin 8) | 
(ee eo ). Lo 
= 5ml** + mly— (sin 0) + smy 
d 
= tml? 6" + ml (—Asino + 34 (sin )) + im 
The potential energy is V = —mglcos@, so the Lagrangian can be taken to be 


i= kml?" — ml7¥sin @ + mgl cos 6, 
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and the equation of motion is as given. 


Solution 4.35 


Consider Figure 4.16 (page 183): the coordinates of the point P are as in equa- 
tion (4.94), that is x = I(t) sin@ and y = —I(t) cos 6. Differentiating with respect to 
t gives 


a= isin @ + 16 cos8, ns —icos@ + 16 sin 0, 


hence the square of the speed is v2 = 1? +1262. Since the kinetic energy..is. T= 
mv*/2 and the potential energy is V = mgy = —mgl (t) cos 6, the Lagrangian is 


Ls iml(t)?6" + mgl(t) cos @ + tml’. 


The final is a function of time only, so may be ignored. Differentiation gives 
OL / 06 = mi?6 and OL /00 = —mgl sin 6, which gives the quoted equation of motion. 


Solution 4.36 


From equation (4.145) we see that F = SMe — 5 Mz» 80 

oF. OF | oe 

On, a Pt) On, nee eles On 77 
Hence, equation (4.159) gives pn, — Ten, = 0, which is the wave equation. 
Solution 4.37 


Each element of the string, of length éz, has an extra potential energy kn*da/2 
because of the energy stored in the stretched rubber sheet. Hence the action is that 
given in equation (4.145) with the extra potential energy subtracted from it, that 


is 
2 L 
S=} f dt [ax (on? — Terk - i) 
8) 8) 


Using the general result (4.159) this gives the equation of motion pn, — Ten,» + 
kn = 0: this equation also occurs in quantum mechanics where it is known as the 
Klein—Gordon equation. 


Solution 4.38 


Using the potential energy given in the question we see that the total potential 
energy of bending is 


Including this contribution, the action (4.145) gives the result quoted. 


Consider the Gateaux differential of the bending contribution, Sg, to the action: 


ASBln, 9 J=-B a [ 2 a 


Integrating by parts with respect to x twice and using the conditions g(0,t) = 


g(L,t) = 0 gives 
Bf ww [ dey 


If there are the additional boundary conditions then g,(0,t) = g,(L,t) = 0 and 
the boundary term vanishes and the stationary path is determined by the given 
equation. 


ASp\n, g| i -B | dt heads] 
0 
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Solution 4.39 


(a) The potential energy comprises the sum of the component due to work being 
done against the tension, found in the text, 


i 2 
Vi = ar, | dx (3) 
0 


and the work done against the force F(x). For an element of the string, length 
dx, this potential is 0V2 = —F(x)ndx. Compare this with the potential gained 
by lifting a mass m a distance 7 in a gravitational field, where the force is mg, 
so the potential energy gained is —mg7n. 


Hence the potential energy is 


V= ye [ax (2) — [ae ran) 


(b) The corresponding action is 


ce p fa fo ae ae (@) 2, (st) + BF) | 


For a static string, 7, = 0 and all functions are independent of t, so the action 
simplifies to 


s= 5 fa fa |-n (32) +2rem). 


Using equation (4.159) we obtain the equation of motion T.7,.,. + F(x) = 0. 
Integrating this once gives 


On ae is 
a B-#] du F(u), 


where B is a constant. Integrating again gives 


x VU 1 x 
na) =A+Be- = | aw | du F(u) =A+Ba~ > | du (x —u)F(u), 
Te 0 0 Te 0 


where A is another constant. The boundary conditions, 7(0) = 7(L) = 0 give 
A=0Oand BL — “5 i du (L —wu)F(u) = 0, respectively, so that 


L 


=f au(t— wr) f° du(e— wre 


If fF =k, for some constant k these integrals give n = ka(L — x)/(2T.). 


(c) Using equation (4.159) we obtain the equation of motion 


0?n Orn 
p(t) ae = we) = F(z) me, 


Solution 4.40 


(a) On any surface two coordinates are needed to specify a position, so this system 
has two degrees of freedom. Possible coordinates are the latitude and longitude. 


(b) If the rod were rigid the system would be a simple pendulum, with one degree 
of freedom. Because the length of the rod can change another coordinate is 
required to specify the position of the particle, so the system has two degrees 
of freedom. Two possible coordinates are the angle between the rod and the 
downward vertical and the length of the rod. 


(c) The only difference between this and the previous example is that the end 
of the rigid rod moves on a sphere, not a vertical plane, and requires two 
coordinates to define its position. Thus for the elastic rod, three coordinates 
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are needed and the system has three degrees of freedom: possible generalised 
coordinates are the two spherical polar angles of the rod. 


(d) If the end of the rod on the curve were fixed we should have a conventional 
vertical pendulum, with one degree of freedom. Because its hinge can move 
this adds a second degree of freedom. Coordinates could be the angle between 
the rod and the downward vertical and the distance of the hinge along the 
curve. 


Solution 4.4] 
(a) The equation of motion for Li is d(q+q) /dt—(q¢+¢q) or g—q=0. The 
equation of motion for Lz is d(q)/dt — q or gd—q=0. 


(b) We have 
d 
I1=4(¢ +4) +g Be +o a.) 


Solution 4.42 
(a) The independent variables are the coordinates (x1, y1, 21) and (2, y2, z2). Dif 
ferentiate r? with respect to 2; and x2 to obtain 
O 
its = 2(4%1—22) and or = —2(x1 — 22), 
which gives the quoted results. Differentiation with respect to (y1,y2) and 
(z1, 22) gives similar relations. 
(b) The chain rule gives 
av _aV dr OV _ av ar aV _ av ar 
Oc, Or Ox,’ Oyn Or Oy,’ Oze Or Ozp’ 
Since Or/Ox, = —Or/Oxz, etc., we see that OV/Ox, = —OV/0Z2, etc. Hence 
OV 7 (A= Y1 — Y2 4= 32) - None 


ely. 


i a Ge 
- Or, Or r+ r r Or 


and Fo; = —OV/Or2 = —Fi2. 


Solution 4.43 
The generalised coordinates are u, v and @, and differentiation gives 

£ = (uv + uv) cos¢ — uvd sin d, 

y = (uv + uv) sing + uvd cos ¢, 

z= UU — V0. 
The speed is v? = &? + y? + 27; it is easiest to first evaluate £2 + y?: 

e+ 9° = (av + uv)? cos? @ — 2 (tv + ud) uvdsin cos d + u?v¢? sin? ¢ 

+ (wu + ub)? sin? d + 2 (tv + ud) uvdsin bcos ¢ + u2v7¢? cos? d 
x2 
= (iw + ub)? + (wvd) 
Hence, on adding 27 to this, we obtain 
se 
oy? +22 = (Wwtut) + (wvd) + (ut — vd)? 
= (u? + 0°) (1? +07) + u2vo?. 


This is asum of the squares of the generalised velocities, with coefficients depending 
upon the generalised coordinates. 
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Solution 4.44 


(a) A neat way of obtaining the inverse transformation is to use matrices. The 
transformation is linear and can be represented as follows 


Cy )= R00 (F ) RO=( Sine cove) 


(5 )=mor'(2). aor= (358 “ME 


Hence 


So 


x=ucosQt—vsinOt, y=usinQt+vcos Nt. 


(b) Differentiating these expressions and putting c = cos Qt and s = sin Qt gives 
g=uc—vs+QQ(—us—vc) and y=us+b0c4+Q(uc — vs). 
Squaring and adding these gives 
£ + y? = (tic — bs)" + (us + 0c)" 
cy [(—us —yc)* + (uc — vs)” 
+ 20 [(uc — vs) (—us — vc) + (ts + 0c) (uc — vs)]. 
Expanding all these terms gives 


V2 = 40749? (u? + vw) — 20(tv — uv). 


(c) We have u = pcos¢ — po sing, v = psin@+ pd cos ¢, giving 
wtv=p, w+ =p? +(p¢)?, tw—ud =—p¢, 
hence 


? , , 2 
V? = p? + (pd)? + 2076 + 27 p* = p* + p* (2 + Q) , 


Solution 4.45 


We consider both situations together. If / is the length of the string and z the 
distance of the hanging block below the pulley then, because all components are 
moving with speed z the kinetic energy is 
T= 5M24+5M2? + 5m2’. 

The potential energy of the hanging block is —Mgz (because the distance is mea- 
sured downwards). The mass of string above the pulley has potential energy that 
is independent of z. The mass of string below the pulley is mz/Il, and the centre 
of mass of this is a distance z/2 below the pulley, so the potential energy of this 
portion of the string is g x (mz/l) x (—z/2). Hence the total potential energy is 


mg 9 
V = —Mgz - — 
= 
and the Lagrangian is 
L=3(2M +m) + Mgz+ oo 


In the first case the string has zero mass so L = Mz? + Mqz. 
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Solution 4.46 


Let g be the distance between the particle and the origin, so its coordinates are 
x = qcosa and y = qsina. The square of the speed is 


v* = (q¢cosa — qasin a)? + (qsina@ + qacos a)? =(@t+q°a’. 

The potential energy is V = mgy, so the Lagrangian is 
L= 5m (q° f qa) — mgqsina. 

The equation of motion is Gg — ga* + gsina = 0 and with a = Gt this becomes 
q—-@o= gsm ft, 1920. 


The solution of this linear equation comprises two parts; first there is the general 
solution of the homogeneous equation g — g8” = 0, qi (t) = Acosh(Gt) + B sinh((t). 
Then there is a particular solution of the inhomogeneous equation, which is clearly 
of the form q,(t) = C'sin(Gt) for some constant C. This is found by substituting 
into the equation of motion: 


C (-6? — 6°) sin(6t) = —gsin(St), so C=g/26", 
giving the solution satisfying q(0) = A, 
q(t) = Acosh(Gt) + Bsinh(Gt) + FP sin ((t). 


Solution 4.47 


The speed of the particle is given by 
‘ . ‘ ze 
v=a?+y? =(¢+4) + (cag) 
@ (1+a°q’) + 247+ 7 
: e. . 
=P Tl are) = 2a 4 + shia @’)| : 


where the terms in the square brackes can be ignored because they are time deriva- 
tives. The potential energy is V = mgy, hence the Lagrangian is 


i = img (1 + a°q") — tmagq — mq’. 


Solution 4.48 
The square of the speed is 7 + y* and since y = ax& + ax? /2 we have 
ye = g* 4+ (axa + 1 a?)* = j? (1 ss a gl + aax?e + Fara’, 


so the Lagrangian is 


L= ma? (1 + a*z*) + kmaax?& + ima x* a2 mga”. 
But, 
dx* d d 
ae 1 - 4 te A 
aax’ xs = -aa— = =— (aazx”) — 7x — (aa 
a ce = dt ~~ 


so that, on ignoring the first term, the Lagrangian becomes 


to 22 1 a er , da* da\” 
L=5mz (1 + a*x*) — 5mgax —sma it), where Te le 


The partial derivatives required for the equation of motion are: 


OL OL 
—_— = mz (1 oa aa")  —_—= maze” — mgax — m7y(t)x”, 


Ox Ox 


so the equation of motion is 


a 4; , 
ee (@ (1 + a°x?)) — a?xt? + gax + $my(t)x” = 0. 
Expanding this gives 


@(1+a°x*) + 2aax*é + a®xt? + gar + 47(t)x® = 0. 
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If ad = 0 this reduces to equation (4.5) (page 147). 


Solution 4.49 


The action is 
i) 
S= ym | dt (47 + 9° + 27). 
ty 


If (x(t), y(t), z(t)) is the actual path and (x(t) + € €(t), y(t) + €n(t), z(t) + € C(t)) is 
an admissible varied path, then (€,7,¢) are zero at t = t; and tg and the change in 
the action is 


ssa fa (i+) -a"+ (a +69) — 9+ (z+) -# 


=me f dt (#é + 9 + 26) the? [at (P49 +2). 


ty 
But integration by parts gives 
to : to 
/ dt £€ = |€z|," -| dt Ex. 
ty ty 


Since €(t,) = €(t2) = 0 and # = 0, because there are no forces, this integral is zero. 
Similarly, the integrals over y7 and z¢ are zero. Hence 


ta . . 
6S = yme? | dt & + 7° + C) 
ty 


Thus 6S > 0 (unless |€| + |7| + |¢| = 0), so the action is a minimum on the stationary 
path. 


Solution 4.50 
Use the notation defined in the figure and put the z-axis along the governor axis 
BA. The heights of the masses m, and mg are z; = —acos@, z2 = —2acos6, so 


that the total potential energy is 
V(0) = 2mjgz1 + Mogz2 = —2ag (m1 + m2) cos 8. 


Since the particle of mass mz moves only along the z-axis its kinetic energy is 
io = 51225 = 2m a6? sin? 9. The motion of either one of the particles of mass m1 
is more complicated as it moves in three dimensions. Consider the one on the right 
in the figure. In the plane of the paper it has the vertical velocity component z1 
and also the horizontal velocity component x, where 7; = asin@; there is also a 
component of velocity into the paper due to the rotation of the system about the 
axis, which is 7,92 . Thus the speed v1 of this particle is given by 


v2 = a76? sin? 6 + a6? cos? 6 + a7? sin? 6 = a? (6? + 0 sin? 0) 


The speed of the other particle of mass m, is clearly the same. It follows that the 
combined kinetic energy of these particles is T; = m,a? (6? + 0? sin? 0) and hence 


the Lagrangian is 
L(6, 0) = m,a" G AAFP sin 0) + 2ma26? sin? 6 + 2ag (m1 + m2) cos 


— 7 (my + 2mz sin” 0) + mj a7? sin? 6 + 2ag (m1 + m2) cos 6. 


The energy, FE, is given by the first integral 


-OL 
E=0—-L 
OO 
= 2a76? (my + 2mz sin? 0) —L 


= q?6? (my + 2mz sin? 0) — mj a7? sin? 6 — 2ag(m, + m2) cos 6. 
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Solution 4.51 


The string is always tangential to the surface of the perpendicular, cylindrical post; 
let 9 be the angle between the radius to this tangential point and some fixed radius, 
as shown in the diagram, and R(t) the length of the string from this point to the 
particle. 


The distance R depends upon @: the total length of the rope is, to within an additive 
constant, R + a0, so R = —a6. We now determine the coordinates of the mass m 
with respect to the axes Oxy, shown in Figure 4.26. 


Point P, where the rope leaves the cylinder, has coordinates (asin 6,acos@). The Figure 4.26 
x and y distances of the mass from P are, respectively, Rcos@ and —Rsin@, so 

the coordinates of the mass are (asin@ + Rcos6,acos@— Rsin@). The velocity 
components are 


& = (acosé— Rsin#)6+ RcosO and y=-—(asind + Rcos6) 6 — Rsind, 
so that the square of the speed of the mass is 

yo = (a° + =) 6? + R? + 2aR0. 
But 6 = —R/a so this reduces to v? = (RR/a)?. 


The kinetic energy is therefore T = m(RR)?/(2a?). Since the gravitational forces 
are to be neglected the potential energy is constant and the Lagrangian is L = T. 
The first integral, equation (4.84), is 


-OL j 
ee ae ee 
OR 2a? 
Solution 4.52 
Take the origin O to be the centre of the circle and the generalised coordinate to Q 


be @, as shown in Figure 4.27. The potential energy is V(@) = —mgRcos@ and the 
speed of the particle is v? = R70? + (ORsin 6)’, the last term being the component 
of the velocity due to the rotation. The Lagrangian is 


| », 
L= mR? 6? + Em? R? sin? 0+ mgR cos 0. 8 


Figure 4.27 


Solution 4.53 


(a) Lagrange’s equation for the Lagrangian aL is 


af OL) ob _ fee | 4g 
dt \ 84 Og \dt\ dg) Oa) — 


Thus a solution g(t) of Lagrange’s equation for L is also a solution of Lagrange’s 
equation for aL, and vice versa. 


(b) If Q = aq and T = dt, then 


. dq _ d(Q/a) _ bdQ 
V(a) =V(Q/a) =a "V(Q) and = Ger) = Gat” 


and the Lagrangian is 


LT (2. x) = L(q,4) = 3m (=) (2) -a¥@) 


2 
-a* (im (3) — v1a)] if b=al-/?, 


(c) If q= f(t) is a solution of the original equation of motion for L(q,q) then 
Q = af(T/b) is a solution of the equation of motion for L(Q,dQ/dT). Using 
the result of part (a) we see that when b = a!~*/?, Q = af (T/b) is a solution of 
the equation of motion for a*L(Q,dQ/dT), and since this is formally the same 
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as the original Lagrangian we conclude that q = af(t/b) will be a solution of 
the equation of motion for L(q,q). But, in this case 


af(t/b) =af (a*/2-14) 


In particular, if f(t) is a periodic solution with unit amplitude and period 7 
then Af(A*/2~1t) is a periodic solution with amplitude A, and its period is 
A)7*8; 


(d) If V(q) x q? then \ = 2 and A!~*/? = 1. Therefore the period is independent 
of the amplitude in this case. 


(e) If V(q) x q~! then \ = —1 and A!~*/? = A3/?, so the period increases as the 
3/2 power of the amplitude. 


Kepler’s third law of planetary motion states that the square of the period 
of each planet in its elliptical orbit is proportional to the cube of the orbit’s 
semi-major axis. This is analogous to the 3/2 power relationship between 
period and amplitude obtained above for one-dimensional motion; of course 
the gravitational potential varies as r~', where r is the planet’s distance from 
the Sun. The following table of the mean distances a of each planet from the 
Sun and their periods confirms Kepler’s third law. 


Planet Mean distance a Period t at 2/3 
million miles AU (years) 

Mercury 0.3871 | 0.2409 
Venus 
Earth 
Mars 
Jupiter 
Saturn 
Uranus 
Neptune 
Pluto 


Solution 4.54 


In this example the independent variable, t, is unchanged so ¢ = 0, in equa- 
tion (3.131) (page 119), ~, = 0 and w, =constant: here we have used the correspon- 
dence y1 = q1, yo = qg and x = t. Hence, equation (3.133) becomes OL /Oq2 = constant, 
which is the integrated form of Lagrange’s equation of motion for qo, that is 


yd (aL) ab _a (at 
dt \ Ode Oq2 dt \ 0g.) ° 
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